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METHODS FOR GENERATING HYPERMUTABLE MICROBES 

5 This application claims the benefit of provisional application S.N. 

60/181 ,929 filed February 1 1 , 2000. 

FIELD OF THE INVENTION 

The invention is related to the area of mismatch repair genes. In 
10 particular it is related to the field of in situ mutagenesis of single celled 
organisms. 

BACKGROUND OF THE INVENTION 

Within the past four years, the genetic cause of the Hereditary 

1 5 Nonpolyposis Colorectal Cancer Syndrome (HNPCC), also known as 
Lynch syndrome II, has been ascertained for the majority of kindred's 
affected with the disease (14). The molecular basis of HNPCC involves 
genetic instability resulting from defective mismatch repair (MMR). 
Several genes have been identified in humans that encode for proteins and 

20 appear to participate in the MMR process, including the mutS homologs 
GTBP, hMSH2, and hMSH3 and the miuL homologs hMLHK hMLH3, 
hPMSJ, and hPMS2 (4,9,11,1 7,19,22,24,38). Germline mutations in four 
of these genes (hMSH2, hMLHi, hPMSJ, and hPMS2) have been identified 
in HNPCC kindred's (13). Though the mutator defect that arises from the 

25 MMR deficiency can affect any DNA sequence, microsatellite sequences 
are particularly sensitive to MMR abnormalities (14,-25, 27, 29). 
Microsatellite instability (MI) is therefore a useful indicator of defective 
MMR. In addition to its occurrence in virtually all rumors arising in 
HNPCC patients, MI is found in a small fraction of sporadic tumors with 

30 distinctive molecular and phenotypic properties (13). 

HNPCC is inherited in an autosomal dominant fashion, so that the 
normal cells of affected family members contain one mutant allele of the 
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relevant MMR gene (inherited from an affected parent) and one wild-type 
allele (inherited from the unaffected parent). During the early stages of 
tumor development, however, the wild-type allele is inactivated through a 
somatic mutation, leaving the cell with no functional MMR gene and 
5 resulting in a profound defect in MMR activity. Because a somatic 
mutation in addition to a germ-line mutation is required to generate 
defective MMR in the tumor cells, this mechanism is generally referred to 
as one involving two hits, analogous to the biallelic inactivation of tumor 
suppressor genes that initiate other hereditary cancers. In line with this 

1 0 two-hit mechanism, the non-neoplastic cells Of HNPCC patients generally 
retain near normal levels of MMR activity due to the presence of the 
wild-type allele (1 1,13, 24). In addition, similar findings are observed in 
other diploid organisms (2, 5, 8). 

The ability to alter signal transduction pathways by manipulation of 

] 5 a gene product's function, either by over-expression of the wild type 

protein or a fragment thereof, or by introduction of mutations into specific 
protein domains of the protein, the so-called dominant-negative inhibitory 
mutant, were described over a decade ago in the yeast system 
Saccharomyces cerevisiae by Herskowitz (Nature 329:219-222, 1987). It 

20 has been demonstrated that over-expression of wild type gene products can 
result in a similar, dominant-negative inhibitory phenotype due most likely 
to the "saturating-out" of a factor, such as a protein, that is present at low 
levels and necessary for activity; removal of the protein by binding to a 
high level of its cognate partner results in the same net effect, leading to 

25 inactivation of the protein and the associated signal transduction pathway. 
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Recently, work done by Nicolaides et.al. (52) has demonstrated the 
utility of introducing dominant negative inhibitory mismatch repair mutants 
into mammalian cells to confer global DNA hypermutability. There is a 
need in the art for additional techniques for generating mutations in 
5 bacteria which can be used to make strains for production, biocatalysis, 
bioremediation, and drug discovery. 

SUMMARY OF THE INVENTION 

It is an object of the present invention to provide a method for 
10 rendering bacterial cells hypermutable. 

It is another object of the present invention to provide genetically 
altered bacteria. 

It is yet another object of the present invention to provide a method 
to produce bacteria that are hypermutable. 
15 It is an object of the invention to provide a method to inactivate the 

process that results in hypermutable cells following strain selection. 

It is a further object of the invention to provide a method of 
mutating a gene of interest in a bacterium. 

These and other embodiments of the invention are provided by one 
20 or more of the embodiments described below. In one embodiment, a 

method is provided for making a hypermutable bacteria. A polynucleotide 
comprising a dominant negative allele of a mismatch repair gene is 
introduced into a bacterium, whereby the cell becomes hypermutable. 
Preferably the allele is under the control of an inducible transcription 
25 regulatory sequence. 

According to another aspect of the invention a homogeneous 
composition of cultured, hypermutable, bacteria is provided. The bacteria 
comprise a dominant negative allele of a mismatch repair gene. Preferably 
the allele is under the control of an inducible transcription regulatory 
30 sequence. 
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Another embodiment of the invention provides a method for 
generating a mutation in a gene of interest. A bacterial culture comprising 
the gene of interest and a dominant negative allele of a mismatch repair 
gene is grown. The cell is hypermutable. It is tested to determine whether 
5 the gene of interest harbors a mutation. Preferably the allele is under the 
control of an inducible transcription regulatory sequence. 

According to still another aspect of the invention a method for 
generating a mutation in a gene of interest is provided. A bacterium 
comprising the gene of interest and a dominant negative allele of a 
10 mismatch repair gene is grown to form a population of mutated bacteria. 
The population of mutated bacteria is cultivated under trait selection 
conditions. At least one of the cultivated bacteria is tested to determine that 
the gene of interest harbors a mutation. Preferably the allele is under the 
control of an inducible transcription regulatory sequence. 
15 Still another aspect of the invention is a method for enhancing the 

mutation rate of a bacterium. A bacterium comprising a dominant negative 
allele of an MMR gene is exposed to a mutagen whereby the mutation rate 
of the bacterium is enhanced in excess of the rate in the absence of mutagen 
and in excess of the rate in the absence of the dominant negative allele. 
20 Preferably the allele is under the control of an inducible transcription 
regulatory sequence. 

Yet another aspect of the invention is a method for 
generating an MMR-proficient bacterium with a new output trait. A 
mismatch repair deficient bacterium comprising a gene of interest and a 
25 dominant negative allele of a mismatch repair gene is grown to form a 
population of mutated bacteria. The population of mutated bacteria is 
cultivated under trait selection conditions. At least one of the cultivated 
bacteria is tested to determine that the gene of interest harbors a mutation. 
Mismatch repair activity is restored to the at least one of the cultivated 
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bacteria. Preferably the allele is under the control of an inducible 
transcription regulatory sequence. 

These and other embodiments of the invention provide the art with 
methods that can generate enhanced mutability in bacteria as well as 
5 providing prokaryotic organisms harboring potentially useful mutations to 
generate novel output traits for commercial applications. The abilityto 
create hypermutable organisms using dominant negative alleles has great 
commercial value for the generation of innovative bacterial strains that 
display new output features useful for a variety of applications, including 

1 0 but not limited to the manufacturing industry for the generation of new 

biochemical s useful for detoxifying noxious chemicals from by-products of 
manufacturing processes or those used as catalysts, as well as helping in 
remediation of toxins present in the environment, including but not limited 
to polychlorobenzenes (PCBs), heavy metals and other environmental 

1 5 hazards for which there is a need to remove them from the environment. In 
addition to obtaining organisms that are useful for removal of toxins from 
the environment, novel microbes can be selected for enhanced activity to 
either produce increased quantity or quality of a protein or non-protein 
therapeutic molecule by means of biotransformation (3). 

20 Biotransformation is the enzymatic conversion, by a microbe or an extract 
derived from the microbe, of one chemical intermediate to the next product. 
There are many examples of biotransformation in use for the commercial 
manufacturing of important biological and chemical products, including 
Penicillin G, Erythromycin, and Clavulanic Acid as well as organisms that 

25 are efficient at conversion of "raw" materials to advanced intermediates 
and/or final products (Berry, A. Trends Biotechnol. 1 4(7):250-256). The 
ability to control DNA hypermutability in host bacterial strains using a 
dominant negative MMR (as described above) allows for the generation of 
variant subtypes that can be selected for new phenotypes of commercial 

30 interest, including but not limited to organisms that are toxin-resistant, have 
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the capacity to degrade a toxin in situ or the ability to convert a molecule 
from an intermediate to either an advanced intermediate or a final product. 
Other applications using dominant negative MMR genes to produce genetic 
alteration of bacterial hosts for new output traits include but are not limited 
5 to recombinant production strains that produce higher quantities of a 
recombinant polypeptide as well as the use of altered endogenous genes 
that can transform chemical or catalyze manufacturing downstream 
processes. 

This application teaches of the use of a regulatable dominant 
1 0 negative MMR phenotype to produce a prokaryotic strain with a 
commercially beneficial output trait. Using this process, microbes 
expressing a dominant negative MMR can be directly selected for the 
phenotype of interest. Once a selected bacterium with a specified output 
trait is isolated, the hypermutable activity of the dominant negative MMR 
1 5 allele can be turned-off by several methods well known to those skilled in 
the art. For example, if the dominant-negative allele is expressed by an 
inducible promoter system, including but not limited to promoters such as: 
TAC-LACI, tryp (Brosius et.al. Gene 27:161-172, 1984), araBAD 
(Guzman et.al., J. Bact. 177:4121-4130, 1995) pLex (La Vallie et.al., 
20 Bio.Technology 1 1:187-193, 1992), pRSET (Schoepfer, R. Gene 124:83- 
85, 1993) , pT7 (Studier J. Mol Biol 219(l):37-44, 1991) etc., the inducer 
is removed and the promoter activity is reduced, or a system that excises 
the MMR gene insert from the host cells harboring the expression vector 
such as the Cre-lox (Hasan, N. et.al. Gene 2:51-56, 1994), as well as 
25 methods that can homologously knockout of the expression vector. In 

addition to the recombinant methods outlined above that have the capacity 
to eliminate the MMR activity from the microbe, it has been demonstrated 
that many chemicals have the ability to "cure" microbial cells of plasmids. 
For example, chemical treatment of cells with drugs including bleomycin 
30 (Attfield et al. Antimicrob. Agents Chemother. 27:985-988, 1985) or 
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novobiocin, coumermycin, and quinolones (Fu et al. Chemotherapy 
34:41 5-41 8, 1 988) have been shown to result in microbial cells that lack 
endogenous plasmid as evidenced by Southern analysis of cured cells as 
well as sensitivity to the appropriate antibiotic (I, 41-43). Whether by use 
5 of recombinant means or treatment of cells with chemicals, removal of the 
MMR-expression plasmid results in the re-establishment of a genetically 
stable microbial cell-line. Therefore, the restoration of MMR allows host 
bacteria to function normally to repair DNA. The newly generated mutant 
bacterial strain that exhibits a novel, selected output trait is now suitable for 
1 0 a wide range of commercial processes or for gene/protein discovery to 
identify new biomolecules that are involved in generating a particular 
output trait. 

While it has been documented that MMR deficiency can lead to as 
much as a 1 000-fold increase in the endogenous DNA mutation rate of a 

1 5 host, there is no assurance that MMR deficiency alone will be sufficient to 
alter every gene within the DNA of the host bacterium to create altered 
biochemicals with new activity(s). Therefore, the use of chemical agents 
and their respective analogues such as ethidium bromide, EMS, MNNG, 
MNU, Tamoxifen, 8-Hydroxyguanine, as well as others listed but not 

20 limited to in publications by: Khroroov-Borisov, N.N., et.al. (Mutat. Res. 
430:55-74, 1999); Ohe, T., et.al. (Mutat. Res. 429:189-199, 1999); Hour, 
T.C. et.al. (Food Chem. Toxicol. 37:569-579, 1999); Hrelia, P., et.al. 
(Chem. Biol. Interact. 1 1 8:99-1 1 1, 1999); Garganta, F., et.al. (Environ. 
Mol. Mutagen. 33:75-85, 1999); Ukawa-lshikawa S., et.al. (Mutat. Res. 

25 412:99-107, 1998); www.ehs.utah.edu/ohh/mutagens , etc. can be used to 
further enhance the spectrum of mutations and increase the likelihood of 
obtaining alterations in one or more genes that can in turn generate host 
bacteria with a desired new output trait(s) (10, 39, 40). Prior art teaches 
that mismatch repair deficiency leads to hosts with an increased resistance 

30 to toxicity by chemicals with DNA damaging activity. This feature allows 
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for the creation of additional genetically diverse hosts when mismatch 
defective bacteria are exposed to such agents, which would be otherwise 
impossible due to the toxic effects of such chemical mutagens [Colella, G., 
et.al. (Br. J. Cancer 80:338-343, 1999); Moreland, N.J., et.al. (Cancer Res. 
5 59:2102-2106, 1999); Humbert, O., et.al. (Carcinogenesis 20:205-214, 
1999); Glaab, W.E., et.al. (Mutat. Res. 398:197-207, 1998)]. Moreover, 
prior art teaches that mismatch repair is responsible for repairing chemical- 
induced DNA adducts, so therefore blocking this process could 
theoretically increase the number, types, mutation rate and genomic 

10 alterations of a bacterial host [Rasmussen, L.~J. et.al. (Carcinogenesis 

17:2085-2088, 1996); Sledziewska-Gojska, E., et.al. (Mutat. Res. 383:31- 
37, 1997); and Janion, C. et.al. (Mutat. Res. 210:15-22, 1989)]. In addition 
to the chemicals listed above, other types of DNA mutagens include 
ionizing radiation and UV-irradiation, which are known to cause DNA 

1 5 mutagenesis in bacteria can also be used to potentially enhance this 

process. These agents which are extremely toxic to host cells and therefore 
result in a decrease in the actual pool size of altered bacterial cells are more 
tolerated in MMR defective hosts and in rum allow for a enriched spectrum 
and degree of genomic mutation (7). 

20 This application teaches new uses of MMR deficient bacterial cells 

to create commercially viable microbes that express novel output traits. 
Moreover, this application teaches the use of dominant negative MMR 
genes to decrease the endogenous MMR activity of the host followed by 
placing the cells under selection to obtain a desired, sought after output 

25 trait for commercial applications such as but not limited to recombinant 
manufacturing, biotransformation and bioremediation. Furthermore, the 
application teaches the use of restoring MMR activity to the hypermutable 
bacterial host following strain selection of the variant of interest as a means 
to genetically "fix" the new mutations in the host genome. The application 

30 also teaches the use of enhanced hypermutability in bacteria by using 
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MMR deficiency and chemical or radiation mutagenesis to create variant 
subtypes of bacteria useful for commercial and other applications. The 
application describes uses of hypermutable bacteria for producing strains 
that can be used to generate new output traits for chemical manufacturing, 
5 pharmaceutical and other commercially applicable processes. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1. Western blot of steady-state PMS134 levels in IPTG-treated 
1 0 samples in DH 1 OB strain. Cells containing the pTACPMS 1 34 (lane 2) 
showed a robust steady state level of protein after induction in contrast to 
cells expressing empty vector (lane 1). Blots were probed with an anti- 
human-PMS2 polyclonal antibody. 

1 5 Figure 2. Western blot of PMS1 34V5 and PMSR3V5 in IPTG-treated (+) 
and untreated (-) samples in BL2 1 strain. Blots were probed with an anti- 
V5 antibody, which is directed to the C-terminal tag of each protein. 

Figure 3. Number of Kanamycin resistant PMS134 and vector control 
20 DH10B clones. IPTG-induced strains were grown and plated onto KAN 
plates and grown for 18 hours at 37°C to identify number of KAN resistant 
clones due to genetic alteration. 

Figure 4. Number of Kanamycin Resistant PMS134, PMSR3 and vector 
25 control BL21 clones. IPTG-induced strains were grown and plated onto 
AMP and KAN plates and grown for 1 8 hours at 37°C to identify number 
of KAN resistant clones due to genetic alteration. 

Figure 5. (A) Western blot of steady-state ATPMS134flag in IPTG-treated 
30 samples in DM OB. Lysates from untransfected cells (lane 1) and a 

-9- 
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bacterial clone expressing the Arabidopsis thaliana PMS134 truncated 
protein with a FLAG epitope fused to the C-terminus (ATPMS134flag) 
(lane 2) were electrophoresed on SDS-PAGE gels. Blots were probed with 
an anti-FLAG monoclonal antibody directed to the FLAG epitope. (B) 
5 Number of Kanamycin Resistant ATPMS 134 flag and vector control 

DH10B clones. IPTG-induced strains were grown and plated onto AMP 
and KAN plates and grown for an additional 1 8 hours at 37°C to identify 
number of KAN resistant clones due to genetic alteration. 

10 Figure 6. Generation of high recombinant producer BGAL-MOR lines in 
PMS134 expressing DH5alpha host strains. 

DETAILED DESCRIPTION OF THE INVENTION 

1 5 The inventors present a method for developing hypermutable 

bacteria by altering the activity of endogenous mismatch repair activity of 
hosts. Wild type and some dominant negative alleles of mismatch repair 
genes, when introduced and expressed in bacteria, increase the rate of 
spontaneous mutations by reducing the effectiveness of the endogenous 

20 MMR-mediated DNA repair activity, thereby rendering the bacteria highly 
susceptible to genetic alterations due to hypermutability. Hypermutable 
bacteria can then be utilized to screen for novel mutations in a gene or a set 
of genes that produce variant siblings that exhibit a new output trait(s) not 
found in the wild type cells. 

25 The process of mismatch repair, also called mismatch proofreading, 

is an evolutionarily highly conserved process that is carried out by protein 
complexes described in cells as disparate as prokaryotic cells such as 
bacteria to more complex mammalian cells ( 14, 29, 3 1 , 33, 34). A 
mismatch repair gene is a gene that encodes one of the proteins of such a 

30 mismatch repair complex. Although not wanting to be bound by any 

- 10- 
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particular theory of mechanism of action, a mismatch repair complex is 
believed to detect distortions of the DNA helix resulting from non- 
complementary pairing of nucleotide bases. The non-complementary base 
on the newer DNA strand is excised, and the excised base is replaced with 
5 the appropriate base that is complementary to the older DNA strand. In 
this way, cells eliminate many mutations that occur as a result of mistakes 
in DNA replication, resulting in genetic stability of the sibling cells derived 
from the parental cell. 

Some wild type alleles as well as dominant negative alleles cause a 

1 0 mismatch repair defective phenotype even in the presence of a wild-type 
allele in the same eel). An example of a dominant negative allele of a 
mismatch repair gene is the human gene hPMS2-134, which carries a 
truncation mutation at codon 134 (32). The mutation causes the product of 
this gene to abnormally terminate at the position of the 134th amino acid, 

15 resulting in a shortened polypeptide containing the N-terminal 133 amino 
acids. Such a mutation causes an increase in the rate of mutations, which 
accumulate in cells after DNA replication. Expression of a dominant 
negative allele of a mismatch repair gene results in impairment of 
mismatch repair activity, even in the presence of the wild-type allele. Any 

20 mismatch repair allele, which produces such effect, can be used in this 
invention. In addition, the use of over-expressed wildtype MMR gene 
alleles from human, mouse, plants, and yeast in bacteria has been shown to 
cause a dominant negative effect on the bacterial hosts MMR activity (9, 
33, 34, 38). 

25 Dominant negative alleles of a mismatch repair gene can be 

obtained from the cells of humans, animals, yeast, bacteria, plants or other 
organisms. Screening cells for defective mismatch repair activity can 
identify such alleles. Mismatch repair genes may be mutant or wild type. 
Bacterial host MMR may be mutated or not. The term bacteria used in this 

30 application include any organism from the prokaryotic kingdom. These 



BNSDOCIO. <WO 01 59092 A2. 1_> 



WO 01/59092 



PCT/US01/04339 



organisms include genera such as but not limited to Agrobacterium, 
Anaerobacter, Aquabacterium, Azorhizobium, Bacillus, Bradyrhizobium, 
Cryobacterium, Escherichia, Enterococcus, Heliobacterium, Klebsiella, 
Lactobacillus, Methanococcus, Methanothermobacter, Micrococcus, 
5 Mycobacterium, Oceanomonas, Pseudomonas, Rhizobium, 

Staphylococcus, Streptococcus, Streptomyces, Thermusaquaticus, 
Thermaerobacter, Thermobacillus, etc. Other procaryotes that can be used 
for this application are listed at ( www.bacterio.cict.fr/validgenericnames ). 
Bacteria exposed to chemical mutagens or radiation exposure can be 
1 0 screened for defective mismatch repair. Genomic DNA, cDNA, or mRNA 
from any cell encoding a mismatch repair protein can be analyzed for 
variations from the wild type sequence. Dominant negative alleles of a 
mismatch repair gene can also be created artificially, for example, by 
producing variants of the hPMS2-134 allele or other mismatch repair genes 
15 (32). Various techniques of site-directed mutagenesis can be used. The 

suitability of such alleles, whether natural or artificial, for use in generating 
hypermutable bacteria can be evaluated by testing the mismatch repair 
activity (using methods described in ref 32) caused by the allele in the 
presence of one or more wild-type alleles, to determine if it is a dominant 
20 negative allele. 

A bacterium that over-expresses a wild type mismatch repair allele 
or a dominant negative allele of a mismatch repair gene will become 
hypermutable. This means that the spontaneous mutation rate of such 
bacteria is elevated compared to bacteria without such alleles. The degree 
25 of elevation of the spontaneous mutation rate can be at least 2-fold, 5-fold, 
10-fold, 20-fold, 50-fold, 100-fold, 200-fold, 500-fold, or 1000-fold that of 
the normal bacteria as measured as a function of bacterial doubling/minute. 

According to one aspect of the invention, a polynucleotide encoding 
either a wild type or a dominant negative form of a mismatch repair protein 
30 is introduced into bacteria. The gene can be any dominant negative allele 
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encoding a protein which is part of a mismatch repair complex, for 
example, mutS, mutL, mutH, or mw/Khomologs of the bacterial, yeast, plant 
or mammalian genes (14, 28). Hie dominant negative allele can be 
naturally occurring or made in the laboratory. The polynucleotide can be in 
5 the form of genomic DNA, cDNA, RNA, or a chemically synthesized 
polynucleotide or polypeptide. The molecule can be introduced into the 
cell by transfection, transformation, conjugation, fusion, or other methods 
well described in the literature. 

Any process can be used whereby a polynucleotide or polypeptide 

10 is introduced into a cell. The process of gene "transfer can be carried out in 
a bacterial culture using a suspension culture. The bacteria can be any type 
classified under the prokaryotes. 

In general, gene transfer will be carried out using a suspension of 
cells but other methods can also be employed as long as a sufficient 

1 5 fraction of the treated cells incorporate the polynucleotide or polypeptide 
so as to allow recipient cells to be grown and utilized. The protein product 
of the polynucleotide may be transiently or stably expressed in the cell. 
Techniques for gene transfer are well known to those skilled in the art. 
Available techniques to introduce a polynucleotide or polypeptide into a 

20 prokaryote include but are not limited to electroporation, transduction, cell 
fusion, the use of chemically competent cells (e.g. calcium chloride), and 
packaging of the polynucleotide together with lipid for fusion with the cells 
of interest. Once a eel! has been transformed with the dominant negative 
mismatch repair gene or protein, the cell can be propagated and 

25 manipulated in either liquid culture or on a solid agar matrix, such as a petri 
dish. If the transfected cell is stable, the gene will be retained and 
expressed at a consistent level when the promoter is constitutively active, 
or when in the presence of appropriate inducer molecules when the 
promoter is inducible, for many cell generations, and a stable, 

30 hypermutable bacterial strain results. 

- 13- 
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An isolated bacteria] cell is a clone obtained from a pool of a 
bacterial culture by chemically selecting out non-recipient strains using, for 
example, antibiotic selection of an expression vector. If the bacterial cell is 
derived from a single cell, it is defined as a clone. 
5 A polynucleotide encoding a dominant negative form of a mismatch 

repair protein can be introduced into the genome of a bacterium or 
propagated on an extra-chromosomal plasmid. Selection of clones 
harboring the mismatch repair gene expression vector can be accomplished 
by addition of any of several different antibiotics, including but not limited 
10 to ampicillin, kanamycin, chloramphenicol, zeocin, and tetracycline. The 
microbe can be any species for which suitable techniques are available to 
produce transgenic microorganisms, such as but not limited to genera 
including Bacillus, Pseudomonas, Staphylococcus, Escherichia and others. 
Any method for making transgenic bacteria known in the art can be 
1 5 used. According to one process of producing a transgenic microorganism, 
the polynucleotide is transfected into the microbe by one of the methods 
well known to those in the art. Next, the microbial culture is grown under 
conditions that select for cells in which the polynucleotide encoding the 
mismatch repair gene is either incorporated into the host genome as a stable 
20 entity or propagated on a self-replicating extra-chromosomal plasmid, and 
the protein encoded by the polynucleotide fragment transcribed and 
subsequently translated into a functional protein within the cell. Once 
transgenic microbe is engineered to harbor the expression construct, it is 
then propagated to generate and sustain a culture of transgenic microbes 
25 indefinitely. 

Once a stable, transgenic microorganism has been engineered to 
express a functional mismatch repair (MMR) protein, the microbe can be 
exploited to create novel mutations in one or more target gene(s) of interest 
harbored within the same microorganism. A gene of interest can be any 
30 gene naturally possessed by the bacterium or one introduced into the 
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bacterial host by standard recombinant DNA techniques. The target 
gene(s) may be known prior to the selection or unknown. One advantage 
of employing such transgenic microbes to induce mutations in resident or 
extra-chromosomal genes within the microbe is that it is unnecessary to 
5 expose the microorganism to mutagenic insult, whether it be chemical or 
radiation in nature, to produce a series of random gene alterations in the 
target gene(s). This is due to the highly efficient nature and the spectrum 
of naturally occurring mutations that result as a consequence of the altered 
mismatch repair process. However, it is possible to increase the spectrum 

10 and frequency of mutations by the concomitant use of either chemicals 
and/or radiation together with MMR defective cells. These include DNA 
mutagens, DNA alkylating agents, DNA intercalating agents, DNA 
oxidizing agents, ionizing raditation, and ultraviolet radiation. The net 
effect of the combination treatment is the increase in altered gene pool in 

1 5 the genetically altered microbe that result in an increased alteration of an 
allele(s) that are useful for producing new output traits. Another benefit of 
using MMR-defective microbes that are taught in this application is that 
one can perform a genetic screen for the direct selection of variant sub- 
clones that exhibit new output traits with commercially important 

20 applications. This allows one to bypass tedious and time consuming gene 
identification, isolation and characterization. 

Mutations can be detected by analyzing the recombinant microbe 
for alterations in the genotype and/or phenotype post-activation of the 
decreased mismatch repair activity of the transgenic microorganism. Novel 

25 genes that produce altered phenotypes in MMR-defective microbial cells 
can be discerned by any variety of molecular techniques well known to 
those in the art. For example, the microbial genome can be isolated and a 
library of restriction fragments cloned into a plasmid vector. The library 
can be introduced into a "normal" cell and the cells exhibiting the novel 

30 phenotype screened. A plasmid is isolated from those normal cells that 

- 15- 



BNSDOCID: <WO 0159092A2J_> 



WO 01/59092 



PCT/US01/04339 



exhibit the novel phenotype and the gene(s) characterized by DNA 
sequence analysis. Alternatively, differential messenger RNA screen can 
be employed utilizing driver and tester RNA (derived from wild type and 
novel mutant respectively) followed by cloning the differentia] transcripts 
5 and characterizing them by standard molecular biology methods well 
known to those skilled in the art. Furthermore, if the mutant sought is "on 
encoded by an extrachromosmal plasmid, then following co-expression of 
the dominant negative MMR gene and the gene of interest to be altered and 
phenotypic selection, the plasmid is isolated from mutant clones and 
10 analyzed by DNA sequence analysis by methods well known to those in the 
art. Phenotypic screening for output traits in MMR-defective mutants can 
be by biochemical activity and/or a physical phenotype of the altered gene 
product. A mutant phenotype can also be detected by identifying 
alterations in electrophoretic mobility, DNA binding in the case of 
1 5 transcription factors, spectroscopic properties such as IR, CD, X-ray 

crystallography or high field NMR analysis, or other physical or structural 
characteristics of a protein encoded by a mutant gene. It is also possible to 
screen for altered novel function of a protein in situ, in isolated form, or in 
model systems. One can screen for alteration of any property of the 
20 microorganism associated with the function of the gene of interest, whether 
the gene is known prior to the selection or unknown. The aforementioned 
screening and selection discussion is meant to illustrate the potential means 
of obtaining novel mutants with commercially valuable output traits. 

Plasmid expression vectors that harbor the mismatch repair (MMR) 
25 gene inserts can be used in combination with a number of commercially 

available regulatory sequences to control both the temporal and quantitative 
biochemical expression level of the dominant negative MMR protein. The 
regulatory sequences can be comprised of a promoter, enhancer or 
promoter/enhancer combination and can be inserted either upstream or 
30 downstream of the MMR gene to control the expression level. The 
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regulatory promoter sequence can be any of those well known to those in 
the art, including but not limited to the lacJ, tetracycline, tryptophan- 
inducible, phosphate inducible, T7-polymerase-inducible (30), and steroid 
inducible constructs as well as sequences which can result in the excision 
5 of the dominant negative mismatch repair gene such as those of the Cre- 
Lox system. These types of regulatory systems are familiar to those skilled 
in the art. 

Once a microorganism with a novel, desired output trait of interest 
is created, the activity of the aberrant MMR activity can beattenuated or 

1 0 eliminated by any of a variety of methods, including removal of the inducer 
from the culture medium that is responsible for promoter activation, gene 
disruption of the aberrant MMR gene constructs, electroporation and/or 
chemical curing of the expression plasmids (Brosius, Biotechnology 
10:205-225,1988; Wang et al., J. of Fujian Agricultural University 28:43- 

15 46,1999; Fu et. al., Chem Abstracts 34:415-418, 1988). The resulting 
microbe is now useful as a stable strain that can be applied to various 
commercial applications, depending upon the selection process placed upon 
it. 

In cases where genetically deficient mismatch repair bacteria 
20 [strains such as but not limited to: M 1 (mutS) and in EC24 1 6 (mutS delta 
umuDC), and mutL or mutY strains] are used to derive new output traits, 
transgenic constructs can be used that express wild-type mismatch repair 
genes sufficient to complement the genetic defect and therefore restore 
mismatch repair activity of the host after trait selection [Grzesiuk, E. et.al. 
25 (Mutagenesis 13:127-132, 1998); Bridges, B.A., et.al. (EMBO J. 16:3349- 
3356, 1997); LeClerc, J.E., Science 15:1208-1211, 1996); Jaworski, A. 
et.al. (Proc. Natl. Acad. Sci USA 92:1 1019-11023, 1995)]. The resulting 
microbe is genetically stable and can be applied to various commercial 
practices. 
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The use of over expressing foreign mismatch repair genes from 
human and yeast such as PMS1, MSH2, MLHI, MLH3, etc. have been 
previously demonstrated to produce a dominant negative mutator 
phenotype in bacterial hosts (35, 36, 37). In addition, the use of bacterial 
5 strains expressing prokaryotic dominant negative MMR genes as well as 
hosts that have genomic defects in endogenous MMR proteins have also 
been previously shown to result in a dominant negative mutator phenotype 
(29,32). However, the findings disclosed here teach the use of MMR 
genes, including the human PMSR2 and PMSR3 gene (ref 19), the related 

1 0 PMS 1 34 truncated MMR gene (ref 32), the plant mismatch repair genes 
(SARAH include Plant patent application) and those genes that are 
homologous to the 1 34 N-terminal amino acids of the PMS2 gene which 
include the MutL family of MMR proteins and including the PMSR and 
PMS2L homologs desribed by Hon et.al. (accession number NM 005394 

1 5 and NM 005395) and Nicolaides (reference 1 9) to create hypermutable 

microbes. In addition, this application teaches the use of DNA mutagens in 
combination with MMR defective microbial hosts to enhance the 
hypermutable production of genetic alterations. This accentuates MMR 
activity for generation of microorganisms with commercially relevant 

20 output traits such as but not limited to recombinant protein production 
strains, biotransformation, and bioremediation. 

The above disclosure generally describes the present invention. A 
more complete understanding can be obtained by reference to the following 
specific examples that will be provided herein for purposes of illustration 

25 only, and are not intended to limit the scope of the invention 
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EXAMPLES 

Example 1: Generation of inducible MMR dominant negative allele 
vectors 

Bacterial expression constructs were prepared to determine if the human 
5 PMS2 related gene (hPMSR3) ( 1 9) and the human PMS 1 34 gene (32) are 
capable of inactivating the bacterial MMR activity and thereby increase the 
overall frequency of genomic hypermutation, a consequence of which is the 
generation of variant sib cells with novel output traits following host 
selection. Moreover, the use of regulatable expression vectors will allow 

10 for suppression of dominant negative MMR alleles and restoration of the 
MMR pathway and genetic stability in hosts cells (43). For these studies, 
a plasmid encoding the hPMSl 34 cDNA was altered by polymerase chain 
reaction (PCR). The 5' oligonucleotide has the following structure: 5'- 
ACG CAT ATG GAG CGA GCT GAG AGC TCG AGT-3' that includes 

1 5 the Ndel restriction site CAT ATG. The 3'-oligonucleotide has the 

following structure: 5'-GAA TTC TTA TCA CGT AGA ATC GAG ACC 
GAG GAG AGG GTT AGG GAT AGG CTT ACC AGT TCC AAC CTT 
CGC CGA TGC-3' that includes an EcoRJ site GAA TTC and the 14 
amino acid epitope for the V5 antibody. The oligonucleotides were used 

20 for PCR under standard conditions that included 25 cycles of PCR (95°C 
for 1 minute, 55°C for 1 minute, 72°C for 1.5 minutes for 25 cycles 
followed by 3 minutes at 72°C). The PCR fragment was purified by gel 
electrophoresis and cloned into pTA2.1 (InVitrogen) by standard cloning 
methods (Sambrook et al., Molecular Cloning: A Laboratory Manual, Third 

25 Edition, 2001), creating the plasmid pTA2.1-hPMS134. pTA2.1-hPMS134 
was digested with the restriction enzyme EcoRI to release the insert (there 
are two EcoRI restriction sites in the multiple cloning site of pTA2. 1 that 
flank the insert) and the fragment filled in with Klenow fragment and 
dNTPs. Next, the fragment was gel purified, then digested with Ndel and 

30 inserted in pT7-Ea that had been digested with Ndel and BamHl (filled 
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with Klenow) and phosphatase treated. The new plasmid was designated 
pT7-Ea-hPMS134. The following strategy, similar to that described above 
to clone human PMS1 34, was used to construct an expression vector for the 
human related gene PMSR3. First, the hPMSR3 fragment was amplified 
5 by PCR to introduce two restriction sites, an Ndel restriction site at the 5'- 
end and an Eco RJ site at the 3 '-end of the fragment. The 5'- 
oligonucleotide that was used for PCR has the following structure: 5'-ACG 
CAT ATG TGT CCT TGG CGG CCT AGA-3' that includes the Ndel 
restriction site CAT ATG. The 3 '-oligonucleotide used for PCR has the 
1 0 following structure: 5'-GAA TTC TTA TTA CGT AGA ATC GAG ACC 
GAG GAG AGG GTT AGG GAT AGG CTT ACC CAT GTG TGA TGT 
TTC AGA GCT-3' that includes an EcoRI site GAA TTC and the V5 
epitope to allow for antibody detection. The plasmid that contained human 
PMSR3 in pBluescript SK (19) was used as the PCR target with the 

] 5 hPMS2-specific oligonucleotides above. Following 25 cycles of PCR 
(95°C for 1 minute, 55°C for 1 minute, 72°C for 1 .5 minutes for 25 cycles 
followed by 3 minutes at 72°C). The PCR fragment was purified by gel 
electrophoresis and cloned into pTA2. 1 (InVitrogen) by standard cloning 
methods (Sambrook et al., Molecular Cloning: A Laboratory Manual, Third 

20 Edition, 2001 ), creating the plasmid pTA2. 1 -hR3. pTA2. 1 -hR3 was next 
digested with the restriction enzyme EcoRJ to release the insert (there are 
two EcoRI restriction sites in the multiple cloning site of pTA2.1 that flank 
the insert) and the fragment filled in with Klenow fragment and dNTPs. 
Then, the fragment was gel purified, then digested with Ndel and inserted 

25 in pT7-Ea that had been digested with Ndel and BamHI (filled with 

Klenow) and phosphatase treated. The new plasmid was designated pT7- 
Ea-hR3. 

BL2I cells harbor an additional expression vector for the lysozyme 
protein, which has been demonstrated to bind to the T7 polymerase in situ; 
30 this results in a bacterial strain that has very low levels of T7 polymerase 
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expression. However, upon addition of ihe inducer IPTG, the cells express 
high-levels of T7 polymerase due to the IPTG-inducible element that drives 
expression of the polymerase that is resident within the genome of the 
BL21 cells (30). The BL2 1 cells are chloramphenicol resistant due to the 
5 plasmid that expresses lysozyme within the cell. To introduce the pT7- 
hPMS134 or the pT7-hPMSR3 genes into BL21 cells, the cells were made 
competent by incubating the cells in ice cold 50mM CaCb for 20 minutes, 
followed by concentrating the cells and adding super-coiled plasmid DNA 
as desecribe (Maniatis, T.et.al. Cold Spring Harbor Laboratory Press, Third 

10 Edition, 2001). Ampicillin resistant BL21 were selected on LB-agar plates 
[5% yeast extract, 10% bactotryptone, 5% NaCl, 1.5% bactoagar , pH 7.0 
(Difco)] plates containing 25 ug/ml chloramphenicol and 1 OOug/ml 
ampicillin. The next day, bacterial colonies were picked and analyzed for 
vectors containing an intact pTACPMS 1 34 or pTAC empty vector by 

1 5 restriction endonuclease digestion and sequence analysis. 

In addition to constructing a V5-epitope tagged PMS 1 34 construct 
we also constructed and tested a non-epitope tagged version. This was 
prepared to demonstrate that the simple fact of epitope tagging the 
construct did not result in alteration of the dominant-negative phenotype 

20 that PMS 1 34 has on mismatch repair activity. For these studies, a BamHI 
restriction fragment containing the hPMS134 cDNA was filled-in with 
Klenow fragment and then sub-cloned into a Klenow-filled blunt-ended 
Ndel-Xhol site of the pTACLAC expression vector, which contains the 
isopropylthio-P-galactosidase (IPTG)-inducible bacterial TAC promoter 

25 and ampicillin resistance gene as selectable marker. The Ndel-Xhol 
cloning site is flanked by the TACLAC promoter that contains the Lad 
repressor site followed by a Shine Dalgarno ribosome-binding site at the 5' 
flanking region and the T1T2 ribosomal RNA terminator in the 3' flanking 
region. The TACLAC vector also contains the LacI gene, which is 

30 constitutively expressed by the TAC promoter. 
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DH10B bacteria] cells containing the pBCSK vector (Stratagene), 
which constitutively expresses the P-galactosidase gene and contains the 
chloramphenicol resistance marker for selection, were made competent via 
the CaCl 2 method (Maniatis, T.et.al. Cold Spring Harbor Laboratory Press, 
5 1 982). This vector turns bacteria] cells blue when grown in the presence of 
IPTG and X-gal that aids in the detection of bacterial colonies. Competent 
cells were transfected with the pTAC empty vector or the pTACPMSJ34 
vector following the heat-shock protocol. Transfected cultures were plated 
onto LB-agar [5% yeast extract, 1 0% bactotryptone, 5% NaCl, 1.5% 
10 bactoagar , pH 7.0 (Difco)] plates containing 25 pg/ml chloramphenicol 
and lOOug/ml ampicillin. The next day, bacterial colonies were picked 
and analyzed for vectors containing an intact pTACPMS134 or pTAC 
empty vector by restriction endonuclease digestion and sequence analysis. 
Ten clones of each bacteria containing correct empty or PMS134 inserts 
1 5 were then grown to confluence overnight in LB media (5% yeast extract, 
1 0% bactotryptone, 5% NaCl, pH 7.0) containing 1 0 fig/ml 
chloramphenicol and 50 ug/ml ampicillin. The next day TAC empty or 
pTACPMS134 cultures were diluted 1:4 in LB medium plus 50uM IPTG 
(Gold Biotechnology) and cultures were grown for 1 2 and 24 hours at 
20 37°C. After incubation, 50ul aliquots were taken from each culture and 
added to 150uls of 2X SDS buffer and cultures were analyzed for PMS134 
protein expression by western blot. 

Western blots were carried out as follows. 50^ils of each PMS 1 34 
or empty vector culture was directly lysed in 2X lysis buffer (60 mM Tris, 
25 pH 6.8, 2% SDS, 1 0% glycerol, 0.1 M 2-mercaptoethanol, 0.001% 

bromophenol blue) and samples were boiled for 5 minutes. Lysate proteins 
were separated by electrophoresis on 4-20% Tris glycine gels (Novex). 
Gels were electroblotted onto Immobilon-P (Millipore) in 48 mM Tris 
base, 40 mM glycine, 0.0375% SDS, 20% methanol and blocked overnight 
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at 4°C in Tris-buffered saline plus 0.05% Tween-20 and 5% condensed 
milk. Filters were probed with a rabbit polyclonal antibody generated 
against the N-terminus of the human PMS2 polypeptide (Santa Cruz), 
which is able to recognize the PMS134 polypeptide (31), followed by a 
5 secondary goat anti-rabbit horseradish peroxidase-conjugated antibody. 
After incubation with the secondary antibody, blots are developed using 
chemiluminescence (Pierce) and exposed to film to measure PMS134 
expression. 

As shown in Figure 1, a robust expression of PMS134 could be 
1 0 detected in bacterial cells containing pTACPMS 1 34 (lane 2) in contrast to 
cells expressing empty vector (lane 1 ), which had no signal. 

For induction of PMS134 and PMSR3 in BL21 cells, the pT7-Ea- 
hPMS134 or the pT7-Ea-hPMSR3 cells were induced with 50uM IPTG for 
12 and 24 hours. Cell lysates were prepared and analyzed by western blot 
1 5 listed above using either the N-termina! PMS2 antibody to detect the 

PMS134 containing cells or the antiV5-horseradish peroxidase conjugated 
monoclonal antibody (InVitrogen) to detect the PMS134V5 and PMSR3V5 
polypeptides. Figure 2 shows the expression of PMS134V5 and PMSR3V5 
before (-) lanes and after IPTG (+) lanes induction. 

20 

Example 2: Generation of hypermutable bacteria with inducible 
dominant negative alleles of mismatch repair genes 

Bacterial clones expressing the PMS134 or the empty vector were 

25 grown in liquid culture for 24 hr at 37°C in the presence of 1 0 ug/ml 

chloramphenicol and 50 ug/ml ampicillin plus 50uM IPTG. The next day, 

cultures were diluted 1:10 in medium containing 50uM IPTG plus 

ampicillin/chloramphenicol (AC) or ampicillin/chloramphenicol plus 25 

pg/ml kanamycin (ACK) and cultures were grown for 1 8 hr at 37°C. The 

30 following day, a 0.1 pi aliquot (2 pi diluted in 1000 pi of LB medium and 
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used 50 pi for plating) of cells grown in AC medium were plated on LB- 
agar plates containing 40 pg/ml of 5-bromo-4-chloro-3-indolyl-B-D- 
galactoside (X-gal) plus ]00pg/ml ampicillin (AMP), while a 1 pi aliquot 
(1 pi diluted in 100 pi of LB medium and used 100 pi for plating) of cells 
5 grown in ACK medium were plated on LB-agar plates containing X-gal 
and 50pg/m! kanamycin (KAN). Plates were incubated for 1 8 hours at 
37°C. The results from these studies show that cells expressing the 
PMS134 were able to increase hypermutation in the genome of the DH 1 0B 
bacterial strain which resulted in the production of siblings that exhibit new 
1 0 biological traits such as KAN resistance (Figure 3). 

Kanamycin-resistant assays using BL21 cells expressing the 
V5-tagged or untagged PMS134 or PMSR3 polypeptides were 
carried out as described above. BL2 1 bacterial cells that harbor the 
empty vector, pT7-PMS134 or pT7-PMSR3 were grown overnight in 
15 LB supplemented with lOOug/ml ampicillin. The overnight cultures 
were diluted 1 :100 into fresh ampicillin containing medium and 
grown for 2.5 hours at 37°C with continuous shaking. When the cells 
reached an optical density (OD) of 0.6, measured at 600nm, IPTG 
was added to each culture to a final concentration of 0.5mM. Cells 

20 were incubated for 24, and 48 hours; at those time points cells were 
removed for SDS-PAGE analysis and plating (see above). 
BL21/pT7 (empty vector), BL21/pT7-PMS134, and BL21/pT7-R3 
cells were plated onto LB plates, LB plates that contained lOOug/ml 
ampicillin, and plates that contain 50ug/ml Kanamycin. The 

25 equivalent of 1 x 10 7 cells/plate were spread onto the plates. BL21 
cells that harbor the empty vector are capable of growth on LB plates 
as well as LB plates that contain lOOug/ml ampicillin; that is as 
expected since the pT7 expression vector renders the cells ampicillin 
resistant. The vector only control is not capable of growth on 

30 Kanamycin. After 24hr IPTG-induction PMSl 34 or PMSR3 cells 
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had a significant number of KAN resistant cells while none were 
observed in BL21 parental cells grown under similar conditions 
(Figure 4). Moreover, BL2 1 cells containing the PMS134 or PMSR3 
genes under non-lPTG-induced conditions failed to produce any 
5 KAN resistant clones demonstrating the need for expression of the 
PMS polypeptides for hypermutability. A summary outlining the 
data 

and number of Kanamycin resistant bacterial clones is provided in 
TABLE 1. 



TABLE 1. Generation of Kanamycin resistant clones via MMR deficiency 



STRAIN 


# CELLS 
SEEDED 


AMP R 
colonies 


kanR 

colonies 


FREQUENCY 


DH10B VEC 


50,000 


62,000 


0 


0 


DH10B 
PMS134 


50,000 


43,146 


23 


5.3 x JO" 4 


BL21 VEC 


500,000 


520,80 
0 


0 


0 


BL2I T7-Ea- 
PMS134V5 


500,000 


450,00 
0 


2,245 


4.9 x 10" 3 


BL2I T7-Ea- 
PMSR3V5 


500,000 


500,00 
0 


1,535 


3.1 x lO- 3 



These data demonstrate and enable the proof-of-concept that the use of the 
dominant negative MMR genes is a viable approach to creating 
hypermutable bacteria that can lead to the generation of phenotypically 
1 5 diverse offspring when put under selective conditions. 

Using the same protocol as listed above and the same cloning 
strategy, a truncated PMS2 homolog from the Arabidopsis thaliana 
plant (reference the PLANT patent), which was cloned by 
degenerate PCR from an Arabidopsis thaliana cDNA library 
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(Strategene), was found to give a similar enhancement of genetic 
hypermutabiliry in DH5alpha bacteria Figure 5. For detection 
purposes, we fused a FLAG epitope to the C-terminus of the PMS134 
polypeptide using PCR and an antisense primer directed to the 134 
5 codon region of the Arabidopsis PMS2 homolog followed by a 
FLAG epitope and 2 termination codons. The resultant fusion was 
termed ATPMS 1 34-flag. The AT PMS1 34-flag gene was then 
cloned into the IPTG-inducible TACLAC expression vector and 
transfected into DH5aIpha cells. Western blot of bacteria transfected 
10 with an IPTG-inducible expression vector carrying a truncated 

version (codons 1-134) of the Arabidopsis thaliana PMS2 homolog 
using the anti-FLAG antibody demonstrated the inducibility and 
steady-state protein levels of the chimeric gene. Figure 5A shows 
the western blot containing protein from an untransfected cell (lane 
15 1) and a bacterial clone expressing the Arabidopsis PMS2-134 
truncated protein (lane 2). Following the mutagenesis protocol 
described above, bacterial cells expressing the ATPMS 134 protein 
were found to have an increase in the number of KAN resistant cells 
(12 clones) in contrast to cells expressing the empty vector that 
20 yielded no KAN resistant clone. 

Bacterial cells such as the pT7-PMSI34 and pT7-R3 
harboring BL21 cells; the TACLACPMS 1 34 DH 1 OB; the 
TACLACMLH 1 DH10B cells; or the TACLAC-ATPMS134flag 
DH5alpha cells are capable of growth on LB, LB/ampicillin and 
25 LB/KAN plates because the cells have acquired mutations within 
their genome that render the cell drug resistant. Cells that express 
dominant negative MMR genes have altered the mismatch control 
pathway of the microbe, presumably altering a gene or a set of genes 
that control resistance to kanamycin. A new output trait, Kanamycin- 
30 resistance, is generated by expression of the dominant negative MMR 
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gene in these cells. These data demonstrate the ability of dominant 
negative MMR genes to produce hypermutability across a wide array 
of bacterial strains to produce new output traits such as Kanamycin 
resistance. 

5 

EXAMPLE 3: Dominant negative MMR genes can produce new 
genetic variants and commercially viable output traits in 
prokaryotic organisms. 

1 0 The data presented in EXAMPLE 2 show the ability to 

generate genetic alterations and new phenotypes in bacterial strains 
expressing dominant negative MMR genes. In this EXAMPLE we 
teach the utility of this method to create prokaryotic strains with 
commercially relevant output traits. 

15 

GENERATION OF HEAT-RESISTANT PRODUCER 
STRAINS. 

One example of commercial utility is the generation of heat- 
resistant recombinant protein producer strains. In the scalable 

20 process of recombinant manufacturing, large-scale fermentation of 
prokaryotes results in the generation of heat, which leads to 
suboptimal growth conditions for the producer strain and thus 
resulting in lower recombinant protein yields. In order to circumvent 
this problem, we employed the use of DH10B bacteria containing the 

25 inducible TACLACPMS1 34 gene. Briefly, cells were grown in 5 ml 
LB shake flasks containing ampicillin and IPTG-induced for 0, 24 
and 48hrs at 37C. Cultures were harvested and then incubated at 
100C for 0, 1 or 10 minutes (times at which 100% of the wild-type 
strain perishes) and 1 00 pi aliquots (equivalent to 250,000 cells) were 

30 plated onto LB agar plates containing ampicillin to identify heat 
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resistant clones. Table 2 shows a typical experiment whereby cells 
containing the TACLACPMS 1 34 gene generated a significant 
number of heat-resistant clones after 48 hours of PMS134 induction 
and hypermutation via MMR blockade. No or a few clones were 
5 observed in the uninduced or 24hr induced conditions respectively 
suggesting the needs for multiple rounds of genetic mutation to 
produce genes that are capable of allowing bacteria to survive under 
harsh conditions. Similar results were observed with other dominant 
negative mutants such as the PMSR2, PMSR3, and the human MLH 1 
1 0 proteins (not shown). 



TABLE 2. Generation of heat-resistant clones via MMR deficiency 



Treatment 


Heated 


Heated 


Heated 




0 min 


1 min 


10 min 


TACLACVEC 


250,000 +/- 


0 


0 


0 hr IPTG 


7,500 






TACLACPMS 134 


265,000 +/- 


0 


0 


0 hr IPTG 


2,000 






TACLACVEC 


274,000 +/- 


1 +/-0 


0 


24 hr IPTG 


12,000 






TACLACPMS 1 34 


240,000 +/- 


5+/- 2 


0 


24 hr IPTG 


9,400 






TACLACVEC 


256,000 +/- 


0 


0 


48 hr IPTG 


12,000 






TACLACPMS 1 34 


252,000 +/- 


65 +/- 8 


3+/- 1 


48 hr IPTG 


14,000 







15 GENERATION OF HIGH RECOMBINANT PROTEIN 
PRODUCER STRAINS. 

Next, we tested the ability of bacteria expressing dominant 
negative MMR genes to produce subl clones with enhanced 
20 recombinant protein production. In these experiments again we 
employed the DH10B cells containing the TACLACPMS 134 
inducible vector plus the pTLACZ vector, which constitutively 
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expresses the fS-galactosidase gene. Analysis of individual clones 
containing the TACLACPMS 1 34 and pTLACZ vector typically 
produces 10-20pg/ml of LACZ protein via shake flask fermentation 
after 1PTG induction for 24 hours. To test the hypothesis that high 
5 recombinant producer strains can be generated by decreased MMR in 
bacterial strains, we induced the TACLACPMS 1 34-pTLACZ cells 
for 48 hours with IPTG as described above. We then diluted the 
culture 1 :50 in LB medium, grew the strain for 24 hours, and plated 
10 pis of culture (diluted in 300 pis of LB) onto LB amp-XGAL 

1 0 plates to identify candidate clones that produce robust levels of 
recombinant LACZ protein. As a control, uninduced cells were 
treated similarly and plated onto LB amp-XGAL plates. Analysis of 
the plates revealed a number of bacterial colonies exhibiting a 
number of clones with an intense BLUE staining in the 

1 5 TACLACPMS 1 34/pTLACZ cells induced with IPTG but none were 
observed in uninduced clones (Figure 6). To confirm that these 
clones produced an enhanced level of LACZ, we expanded 2 clones 
with an average BLUE stain (BGAL-C1 and BGAL-C2) and 10 
clones with a robust BLUE staining (BGAL-MOR1 to BGAL- 

20 MORI 0). We grew all clones in LB AMP for 24 hours without IPTG 
and replated the clones. Six out of ten BGAL-MOR clones resulted 
in a more robust (3-gal stain in situ as compared to control "average" 
cells (BGALC1 and C2). We next performed a more quantitative 
assay using a p*-gal ELISA assay. Briefly, 2 mis of cell centrifuged 

25 at 10,000gs for 10 minutes and resuspended in 0.5 mis of 0.25M Tris, 
pH 7.5 plus 0.0001% Tween-20. Cells were freeze-thawed 4X's and 
vortexed for 4 minutes at room temperature. Lysates were cleared of 
debris by centrifugation and supematants were collected. Protein 
extracts were quantified for total protein using the Bradford assay 

30 (BioRad) as described by the manufacturer. Plate ELlSAs were 
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carried out by coating 96 well maxisorb (NUNC) plates with 0. 1 mis 
of a 1 Mg/ml (diluted in PBS pH7.0) bacterial extract solution and a 
dose range of recombinant p-GAL (Sigma) from 0.001 to 10 mg/ml. 
All samples were plated in triplicates. Plates were coated for 2 hours, 
5 washed 2 times with PBS and blocked with 0.2mls of PBS plus 5% 
powdered milk for 30 minutes. Next, plates were washed once with " 
PBS and incubated with an anti-p-galactosidase monoclonal antibody 
that recognizes both native and denatured forms (Sigma) for 2 hours. 
Plates were then washed 3 times with PBS and incubated with 0.1 
10 mis of an anti-mouse horseradish peroxidase conjugated antibody for 
1 hour at room temperature. Plates were washed 3 times with PBS 
and incubated with TMB ELISA substrate (BioRad) for 15 to 30 
minutes. Reactions were stopped with 0.1N H 2 S0 4 and read on a 
BioRAD plate reader at 41 5nm. The control clones produced 
1 5 roughly 9 and 13 ugs /ml of p-gal while BGAL-MOR clones 2, 3 and 
9 produced 106, 82 and 143 u.gs/ml of p-gal. To determine if reason 
that these clones produced more P-gal was due to mutations in the 
plasmid promoter elements, we isolated the pTLACZ plasmid and 
retransfected it into DH1 0B cells as described above. In situ analysis 
20 found the resultant clones to produce similar amount of P-gal as that 
of the control. These data suggest that the BGAL-MOR 2, 3, and 9 
hosts had alterations, which results in elevated expression and/or 
stability of recombinant proteins. 

To determine if the enhanced in situ p-gal expression that was 
25 observed in BGAL-MOR clones 1 , 5, and 6, which did not appear to 
have enhanced p-gal protein levels (had less than 15 u.g/ml as 
determined by ELISA) was authentic, we performed a more 
quantitative assay on these lines plus the BGAL-MOR 9, the 
BGALC1 and C2 lines as control. Cells containing an empty vector 
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(without a LACZ gene) were used as negative control. To measure 
P-gal activity, we employed a colorimetric p-gal substrate assay 
using CPRG (Roche) as described (3 1 ). Briefly, 5 jigs of protein 
extract isolated for EL1SA analysis (described above) were analyzed 
5 using a plate assay. Protein was added to buffer containing 45 mM 
2-mercaptoethanol, ImM MgCl 2 , 0.1 M NaP0 4 and 0.6 mg/ml 
Chlorophenol red-P-D-galactopyranoside (CPRG, Roche). Reactions 
were incubated for I hour, terminated by the addition of 0.5 M 
Na 2 C03, and analyzed by spectrophotometry at 576 nm in a BioRad 
10 plate reader. Analysis of these extracts confirmed our in situ data 
that these cells did have increased f3-ga! activity (TABLE 3). 



TABLE 3. Generation of bacterial clones with increased p-gal enzymatic activity via 
M MR deficiency. 



Clone 


B-gal protein 
(ug/ml) 


P-gal activity (O.D. 576) 


BGAL-C1 


9 


0.413 +/-.092 


BGAL-C2 


13 


0.393 +/-.105 


BGAL-MOR1 


14 


0.899 +/- .134 


BGAL-MOR5 


13 


0.952 +/- .133 


BGAL-MOR6 


15 


0.923 +/- .100 


BGAL-MOR9 


143 


0.987 +/- .106 


Empty vector 




0.132 +/- .036 



15 

Because there was no observable increase in the amount of p- 
gal protein one likely hypothesis is that the p-gal gene structure was 
mutated during the hypermutability growth stage and now produces a 
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more active enzyme. Sequence analysis confirms that this may be 
the reason for enhanced activity in a subset of clones. 

Together, these data demonstrate the ability to produce 
genetically altered prokaryotic host strains using dominant negative 
5 MMR genes to generate commercially valuable output traits such as 
high recombinant protein producer lines and structurally altered 
enzymes with enhanced activities. 

EXAMPLE 4: Mutations in the host genome generated by 
10 defective MMR are genetically stable 

As described in EXAMPLE 2 and 3, manipulation of the 
MMR pathway in microbes results in alterations within the host 
genome and the ability to select for a novel output traits. It is 
1 5 important that the mutations introduced as a result of defective MMR 
is genetically stable and passed on to daughter cells once a desired 
output pathway is established. To determine the genetic stability of 
mutations introduced into the microbial genome the following 
experiment was performed. Five independent colonies from pT7- 

20 PMS1 34 and pT7-PMSR3 that are kanamycin resistance were grown 
overnight from an isolated colony in 5 ml of LB. Next, 1 uL of the 
overnight culture from these cultures were inoculated into another 5 
mL of LB and grown overnight to saturation. Under these growth 
conditions the microbial cells have divided over 20 generations. 

25 Therefore, if the new output trait generated by alteration of MMR is 
unstable, the cells should "revert" back from kanamycin resistance to 
kanamycin sensitivity. Cells were plated onto LB plates and 
incubated overnight at 37°C. Next, the colonies (about 1,000/plate) 
were replica plated to LB, LB an,p,0 °, and LB 1 "*" 50 plates and incubated 

30 at 37°C overnight. Analysis of clones from these studies reveal that a 
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strict correlation occurs with loss of dominant negative MMR 
expression and phenotype stability. No loss of KAN resistant clones 
generated in Example 3 were observed when cells were grown in the 
absence of IPTG (not expressing PMS134), while 5 revertants out of 
5 1 200 were observed in clones the were continually grown in IPTG 
(express PM SI 34). Extended culturing of cells and replica plating 
found no reversions of KAN resistance in cultures grown in the 
absence of IPTG, which produce no PMS134 as determined by 
western blot (data not shown). 

1 0 These data demonstrate the utility of employing inducible 

expression systems and dominant negative MMR genes in 
prokaryotes to generate genetically altered strains for commercial 
applications such as but not limited to enhanced recombinant 
manufacturing and biotransformation that can then in rum be restored 

15 to a genetically stable host with a "fixed" new genotype that is 
suitable for commercial processes. 

EXAMPLE 5: Enhanced Generation of MMR-Defective Bacteria 
and Chemical Mutagens for the Generation of New Output 
20 Traits 

It has been previously documented that MMR deficiency yields to 
increased mutation frequency and increased resistance to toxic effects of 
chemical mutagens (CM) and their respective analogues such as but not 

25 limited to those as: ethidium bromide, EMS, MNNG, MNU, Tamoxifen, 8- 
Hydroxyguanine, as well as others listed but not limited to in publications 
by: Khromov-Borisov.N.K, et.al. (Mutat. Res. 430:55-74, 1999); Ohe, T., 
et.al. (Mutat. Res. 429:189-199, 1999); Hour, T.C. et.al. (Food Chem. 
Toxicol. 37:569-579, 1999); Hrelia, P., et.al. (Chem. Biol. Interact. 1 18:99- 

30 111, 1999); Garganta, F., et.al. (Environ. Mol. Mutagen. 33:75-85, 1999); 
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Ukawa-Ishikawa S., et.al. (Mutat. Res. 412:99-107, 1998); 
www.ehs. utah.edu/ohli/mutagens , etc. To demonstrate the ability of CMs 
to increase the mutation frequency in MMR defective bacterial cells, we 
exposed T7-PMS134 BL21 cells to CMs. 
5 T7-PMS 1 34 cells and empty vector control cells were grown with 

IPTG for 48 hours and then diluted 1 : 50 in LB plus IPTG and increasing 
amounts of ethyl methane sulfonate (EMS) from 0, 1,10, 50, 100, and 200 
uM. 10 uL aliquots of culture (diluted in 300 u.1 LB) were plated out on 
LB agar plus ampicillin plates and grown overnight at 37C. The next day 

10 plates were analyzed for cell viablility as determined by colony formation. 
Analysis found that while no significant difference in colony number was 
observed between the pT7-PMS134 and control at the 0, ] , or 10 uM 
concentrations (all had > 1000 colonies), the number of control cells were 
reduced to 30 and 0 at the 50 and 100 uM concentrations, respectively. No 

1 5 difference was observed in the pT7-PMS 1 34 cells treated with 0, 1 , 1 0 or 
50 uM, while a 3 fold reduction was observed in cultures treated with 100 
pM EMS. The 200 uM treatment was toxic for both lines. These data 
demonstrate the ability of MMR deficiency to protect prokaryotes against 
the toxic effects of DNA akylating agents and provides a means to generate 

20 a wider range of mutations that can lead to an increased number of genetic 
variations and an increase in the number of new biochemical activities 
within host proteins to produce new output traits for commercial 
applications. 

To confirm that MMR deficient bacterial cells treated with CM can 

25 result in an increased mutation rate and produce a greater number of 

variants, we cultured pT7-PMS134 cells and empty vector controls in the 

presence of IPTG for 48 hours, followed by dilution and regrowth in 25uM 

EMS for 24 hours as described above. Cells were plated out on 100 mM 

petri dishes containing amplicillin or KAN and scored for KAN resistance. 

30 Analysis revealed that an 1 1-fold increase in the generation of KAN 
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resistant cells were found in pT7-Ea-PMS134V5 cells in contrast to control 
cells. 

These data demonstrate the use of employing a regulated dominant 
negative MMR system plus chemical mutagens to produce enhanced 
5 numbers of genetically altered prokaryotic strains that can be selected for 
new output traits. This methods is now useful generating such organims 
for commercial applications such as but not limited to recombinant 
manufacturing, biotransformation, and altered biochemicals 
(biotransformation) with enhanced activities for manufacturing purposes 
10 and gene discovery for pharmaceutical compound development. 

EXAMPLE 6: Alternative Methods to Inhibition of Bacterial 
MMR Activity 

The inhibition of MMR activity in a host organism can be achieved 
1 5 by introducing a dominant negative allele as shown in EXAMPLES 2 and 3. 
This application also teaches us the use of using regulated systems to 
control MMR in prokaryotes to generate genetic diversity and output traits 
for commercial applications. Other ways to regulate the suppression of 
MMR activity of a host is by using genetic recombination to knock out 
20 alleles of a MMR gene that can be spliced out such after selection using a 
system such as the CRE-Lox system; 2) blocking MMR protein 
dimerization with other subunits (which is required for activity) by the 
introduction of polypeptides or antibodies into the host via transfection 
methods routinely used by those skilled in the art; or 3) decreasing the 
25 expression of a MMR gene using anti-sense oligonucleotides. 

MMR gene knockouts. We intend to generate disrupted targeting vectors of 
a particular MMR gene and introduce it into the genome of bacteria using 
methods standard in the art. Bacteria exhibiting hypermutability will be 
30 useful to produce genetically diverse offspring for commercial applications. 
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Bacteria will be confirmed to have lost the expression of the MMR gene 
using standard northern and biochemical techniques (as described in 
reference 32). MMR gene loci can be knocked out, strains selected for new 
output traits and MMR restored by introducing a wildtype MMR gene to 
5 complement the KO locus. Other strategies include using KO vectors that 
can target a MMR gene locus, select for host output traits and then have the 
KO vector "spliced" from the genome after strain generation. This process 
could be performed using systems such as but not limited to CRE-Lox. 

10 Blocking peptides. MMR subunits (MutS and MutL proteins) interact to 
form active MMR complexes. Peptides are able to specifically inhibit the 
binding of two proteins by competitive inhibition. The use of peptides or 
antibodies to conserved domains of a particular MMR gene can be 
introduced into prokaryotic cells using lipid transfer methods that are 

15 standard in the art. Bacteria will be confirmed to have lost the expression 
of the MMR gene using standard northern and biochemical techniques (as 
described in reference 32). Bacteria exhibiting hypermutability will be 
useful to produce genetically diverse sibs for commercial applications. 

20 Discussion 

The results described above will lead to several conclusions. Yhe 
expression of dominant negative MMR proteins results in an increase in 
hypermutability in bacteria. This activity is due to the inhibition of MMR 
biochemical activity in these hosts. This method provides a claim for use of 

25 dominant negative MMR genes and their encoded products for the creation 
of hypermutable bacteria to produce new output traits for commercial 
applications. 
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FXAMPLES OF MMR GENES AND ENCODED POLYPEPTIDES 

Yeast MLH1 cDNA (accession number U07187) 

1 aaataggaat gtgatacctt ctattgcatg caaagatagt gtaggaggcg ctgctattgc 
61 caaagacttt tgagaccgct tgctqtttca ttatagttga ggagttctcg aagacgagaa 
121 attagcagtt ttcggtgttt agtaatcgcg ctagcatgct aggacaattt aactgcaaaa 
181 ttttgatacg atagtgatag taaatggaag gtaaaaataa catagaccta tcaataagca 
241 atgtctctca gaataaaagc acttgatgca tcagtggtta acaaaattgc tgcaggtgag 
301 atcataatat cccccgtaaa tgctctcaaa gaaatgatgg agaattccat cgatgcgaat 
361 gctacaatga ttgatattct agtcaaggaa ggaggaatta aggtacttca aataacagat 
421 aacggatctg gaattaataa agcagacctg ccaatcttat gtgagcgatt cacgacgtcc 
481 aaattacaaa aattcgaaga tttgagtcag attcaaacgt atggattccg aggagaagct 
541 ttagccagta tctcacatgt ggcaagagtc acagtaacga caaaagttaa agaagacaga 
601 tgtgcatgga gagtttcata tgcagaaggt aagatgttgg aaagccccaa acctgttgct 
661 ggaaaagacg gtaccacgat cctagttqaa gacctttttt tcaatattcc ttctagatta 
721 agggccttga ggtcccataa tgatgaatac tctaaaatat tagatgttgt cgggcgatac 
781 gccattcatt ccaaggacat tggcttttct tgtaaaaagt tcggagactc taattattct 
841 ttatcagtta aaccttcata tacagtccag gataggatta ggactgtgtt caataaatct 
901 gtggcttcga atttaattac ttttcatatc agcaaagtag aagatttaaa cctggaaagc 
961 gttgatggaa aggtgtgtaa tttgaatttc atatccaaaa agtccatttc attaattttt 
021 ttcattaata atagactagt gacatgtgat cttctaagaa gagctttgaa cagcgtttac 
081 tccaattatc tgccaaaggg cttcagacct tttatttatt tgggaattgt tatagatccg 
141 gcggctgttg atgttaacgt tcacccgaca aagagagagg ttcgtttcct gagccaagat 
201 gagatcatag agaaaatcgc caatcaattg cacgccgaat tatctgccat tgatacttca 
261 cgtactttca aggcttcttc aatttcaaca aacaagccag agtcattgat accatttaat 
321 gacaccatag aaagtgatag gaataggaag agtctccgac aagcccaagt ggtagagaat 
tcatatacga cagccaatag tcaactaagg aaagcgaaaa gacaagagaa taaactagtc 
agaatagatg cttcacaagc taaaattacg tcatttttat cctcaagtca acagttcaac 
tttgaaggat cgtctacaaa gcgacaactg agtgaaccca aggtaacaaa tgtaagccac 
tcccaagagq cagaaaagct gacactaaat gaaagcgaac aaccgcgtga tgccaataca 
atcaatgata atgacttgaa ggatcaacct aagaagaaac aaaagttggg ggattataaa 
gttccaagca ttgccgatga cgaaaagaat gcactcccga tttcaaaaga cgggtatatt 
agagtaccta aggagcgagt taatgttaat cttacgagta tcaagaaatt gcgtgaaaaa 
gtagatgatt cgatacatcg agaactaaca gacatttttg caaatttgaa ttacgttggg 
861 gttgtagatg aggaaagaag attagccgct attcagcatg acttaaagct ttttttaata 
921 gattacggat ctgtgtgcta tgagctattc tatcagattg gtttgacaga cttcgcaaac 
981 tttggtaaga taaacctaca gagtacaaat gtgtcagatg atatagtttt gtataatctc 
2041 ctatcagaat ttgacgagtt aaatgacgat gcttccaaag aaaaaataat tagtaaaata 
2101 tgggacatga gcagtatgct aaatgagtac tattccatag aattggtgaa tgatggtcta 
2161 gataatgact taaagtctgt gaagctaaaa tctctaccac tacttttaaa aggctacatt 
2221 ccatctctag tcaagttacc attttttata tatcgcctgg gtaaagaagt tgattgggag 
2281 gatgaacaag agtgtctaga tggtatttta agagagattg cattactcta tatacctgat 
2341 atggttccga aagtcgatac actcgatgca tcgttgtcag aagacgaaaa agcccagttt 
2401 ataaatagaa aggaacacat atcctcatta ctagaacacg ttctcttccc ttgtatcaaa 
2461 cgaaggttcc tggcccctag acacattctc aaggatgtcg tggaaatagc caaccttcca 
2521 gatctataca aagtttttga gaggtgttaa ctttaaaacg ttttggctgt aataccaaag 
2581 tttttgttta tttcctgagt gtgattgtgt ttcatttgaa agtgtatgcc ctttccttta 
2641 acgattcatc cgcgagattt caaaggatat gaaatatggt tgcagttagg aaagtatgtc 
2701 agaaatgtat attcggattg aaactcttct aatagttctg aagtcacttg gttccgtatt 
2761 gttttcgtcc tcttcctcaa gcaacgattc ttgtctaagc ttattcaacg gtaccaaaga 
2821 cccgagtcct tttatgagag aaaacatttc atcatttttc aactcaatta tcttaatatc 
2881 attttgtagt attttgaaaa cagqatggta aaacgaatca cctgaatcta gaagctgtac 
2941 cttgtcccat aaaagtttta atttactgag cctttcggtc aagtaaacta gtttatctag 
3001 ttttgaaccg aatattgtgg gcagatttgc agtaagttca gttagatcta ctaaaagttg 
3061 tttgacagca gccgattcca caaaaatttg gtaaaaggag atgaaagaga cctcgcgcgt 
3121 aatggtttgc atcaccatcg gatgtctgtt gaaaaactca ctttttgcat ggaagttatt 
3181 aacaataaga ctaatgatta ccttagaata atgtataa 

Yeast MLH1 protein (accession number U071 87) 

MSLRI KALDAS WN K I AAGE 1 1 1 S PVNALKEMMENS I DANATMI 

DILVKEGG1KVLQITDNGSGINKADLP1LCERFTTSKL0KFEDLSQ1QTYGFRGEA1A 
S 1 SHVARVTVTTKVKEDRCAWRVS YAEGKMLESPKPVAGKDGTT I LVEDLFFH I PSRL 
RALRSHNDEYSKILDWGRYAIHSKDIGFSCKKFGDSNYSLSVKPSYTVODRIRTVFN 
KSVASNLITFHISKVEDLNLESVDGKVCNLNFISKKSI SLI FF1 NNRLVTCDLLRRAi. 
NSVYSNYLPKGFRPFI YLG1 VI DPAAVDVNVHPTKREVRFLSQDE1 1 EKI ANQLHAEL 
SAI DTSRTFKASSI STNKPESLI PFNDT JESDRNRKSLRQAQWEHSYTTANSQLRKA 
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KR0ENKLVRJDAS0AKITSFLSSSQ0FNFEGSSTKRQLSEPKVTHVSHS9EAEKLTLH 
^ ESEQPKDAHT1NDNDLKDQPKKKQKLGDYKVPSIADDEKNALPISKDGYIP.VPKERVN 
VNLTS1KKLRE)WDDS1HRELTDIFANLNYVGWDEERRLAA1<?HDLKLFLIPYCSVC 
YELFYQIGLTDFANFGKINLOSTNVSDDI VLYNLLSEFDELHDDASKEKI ISKIWDMS 
10 SMLMEYYSIELVNDGLDNDLKSVKLKSLPLLLKGYIPSLVKLPFFI YRLGKEVDWEDE 
0ECLDGILRE1ALLY IPDMVPKVDTLDASLEEDEKAOFINP.KEHISSLLEHVLFPC1K 
RRFLAPRHI LKDWE I AHLPDLYKV FERC 

15 

Mouse PMS2 protein 

MEQTEGVSTE CAKAIKPIDG KSVHOICSGO VILSLSTAVK ELIENSVDAG ATTIDLRLKD 60 
YGVDLIEVSD NGCGVEEENF EGLALKHHTS KIQEFADLTQ VETFGFRGEA LSSLCALSDV 120 
TISTCHGSAS VGTP.LVFDHN GKITQKTPYP RPKGTTVSVQ HLFYTLPVRY KEFQRNIKKE 180 

20 YSKMVQVLQA YCIISAGVRV SCTUQLGQGK RHAWCTSGT SGMKENIGSV FGQKQLQSLI 240 
PFVQLPPSDA VCEEYGLSTS GRHKTFSTFR ASFHSARTAP GGVQQTGSFS SSJRGPVTQQ 300 
RSLSLSMRFY HMYNRHQYPF WLNVSVDSE CVDINVTPDK RQILLCEEKL IOAVLKTSLI 360 
GMFDSDANKL NVNQOPLLDV EGNLVKLHTA ELEKPVPGKQ T)NSPSLKSTA DEKRVASISR 420 
LREAFSLHPT KEIKSRGPET AELTRSFPSE KRGVLSSYPS DVISYRGLRG SQDKLVSPTD 480 

25 SPGDCMDREK 1EKDSGLSST SAGSEEEFST PEVAESFSSD YNVSSLEDF.P SQET1NCGDL 540 
DCRPPGTGQS LKPEDHGYQC KALPLARLSP TNAKRFKTEE RPSNVNISOR LPGPQSTSAA 600 
EVDVA1KMNK RIVLLEFSLS SLAKRMKQLO HLKAQNKHEL SYRKFRAKIC PGENQAAEPE 660 
tRKEISKSMF AEMEILGQFN LGF1 VTKLKE DLFLVDQHAA DEKYHFEMLQ QHTVLQAQRL 720 
ITPQTLNLTA VHEAVLIENL EIFRKNGFDF VI DEDAPVTE RAKLISLPTS KNWTFGPQDI 780 

30 DELIFMLSDS PGVMCRPSRV RQMFASRACR KSVMIGTALN ASEMKKLITH MGEMDHPWHC 840 
PHGRPTMRHV ANLDVISQN 
859 

Mouse PMS2 cDNA 

35 gaatt ccggt gaaggtcctg aagaatttcc agattcctga gtatcattgg aggagacaga 60 
taacctgtcg tcaggtaacg atggtgtata tgcaacagaa atgggtgttc ctggagacgc 120 
gtcttttccc gagagcggca ccgcaactct cccgcggtga ctgtgactgg aggagtcctg 180 
catccatgga gcaaaccgaa ggcgtgagta cagaatgtgc taaggccatc aagcctattg 240 
atgggaagtc agtccatcaa atttgttctg ggcaggtgat actcagttta agcaccgctg 300 

40 tgaaggagtt gatagaaaat agtgtagatg ctggtgctac tactattgat ctaaggctta 360 
aagactatgg ggtggacctc attgaagttt cagacaatgg atgtggggta gaagaagaaa 420 
actttgaagg tctagctctg aaacatcaca catctaagat tcaagagttt gccgacctca 480 
cgcaggttga aactttcggc tttcgggggg aagctctgag ctctctgtgt gcactaagtg 540 
atgtcactat atctacctgc cacgggtctg caagcgttgg gactcgactg gtgtttgacc 600 

45 ataatgggaa aatcacccag aaaactccct acccccgacc taaaggaacc acaqtcagtg 660 
tgcagcactt attttataca ctacccgtgc gttacaaaga gtttcagagg aacattaaaa 720 
aggagtattc caaaatggtg caggtcttac aggcgtactg tatcatctca gcaggcgtcc 780 
gtgtaagctg cactaatcag ctcggacagg ggaagcggca cgctgtggtg tgcacaagcg 840 
gcacgtctgg catgaaggaa aatatcgggt ctgtgtttgg ccagaagcag ttgcaaagcc 900 

50 tcattccttt tgttcagctg ccccctagtg acgctgtgtg tgaagagtac ggcctgagca 960 
cttcaggacg ccacaaaacc ttttctacgt ttcgggcttc atttcacagt gcacgcacgg 1020 
cgccgggagg agtgcaacag acaggcagtt tttcttcatc aatcagaggc cctgtgaccc 1080 
agcaaaggtc tctaagcttg tcaatgaggt tttatcacat gtataaccgg catcagtacc 1140 
catttgtcgt ccttaacgtt tccgttgact cagaatgtgt ggatattaat gtaactccag 1200 

55 ataaaaggca aattctacta caagaagaga agctattgct ggccgtttta aagacctcct 1260 
tgataggaat gtttgacagt gatgcaaaca agcttaatgt caaccagcag ccactgctag 1320 
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atgttgaagg taacttagta aagctgcata ctgcagaact agaaaagcct gtgccaggaa 1380 

agcaagataa ctctccttca ctgaagagca cagcagacga gaaaagggta gcatccatct 1440 

ccaggctgag agagqccttt tctcttcatc ctactaaaga gatcaagtct aggggtccag 1500 

agactgctga actgacacgg agttttccaa gtgagaaaag gggcgtgtta tcctcttatc 1560 

ctccagacgt catctcttac agaggcctcc gtggctcgca ggacaaattg gtgagtccca 1620 

cggacagccc tggtgactgt atggacagag agaaaataga aaaagactca gggctcagca 1680 

gcacctcagc tggctctgag gaagagttca gcaccccaga agtggccagt agctttagca 1740 

gtgactataa cgtgagctcc ctagaagaca gaccttctca ggaaaccata aactgtggtg 1800 

acctggactg ccgtccccca ggtacaggac agtccttgaa gccagaagac catggatatc 1860 

aatgcaaagc tctacctcta gctcgtctgt cacccacaaa tgccaagcgc ttcaagacag 1920 

aggaaagacc ctcaaatgtc aacatttctc aaagattgcc tggtcctcag agcacctcag 1980 

cagctgaggt cgatgtagcc ataaaaatga ataagagaat cgtgctcctc gagttctctc 2040 

tgagttctct agctaagcga atgaagcagt tacagcacct aaaggcgcag aacaaacatg 2100 

aactgagtta cagaaaattt agggccaaga tttgccctgg agaaaaccaa gcagcagaag 2160 

atgaactcag aaaagagatt agtaaatcga tgtttgcaga gatggagatc ttgggtcagt 2220 

ttaacctggg atttatagta accaaactga aagaggacct cftcctggtg gaccagcatg 2280 

ctgcggatga gaagtacaac tttgagatgc tgcagcagca cacggtgctc caggcgcaga 2340 

ggctcatcac accccagact ctgaacttaa ctgctgtcaa tgaagctgta ctgatagaaa 2400 

atctggaaat attcagaaag aatggctttg actttgtcat tgatgaggat gctccagtca 2460 

ctgaaagggc taaattgatt tccttaccaa ctagtaaaaa ctggaccttt ggaccccaag 2520 

atatagatga actgatcttt atgttaagtg acagccctgg ggtcatgtgc cggccctcac 2580 

gagtcagaca gatgtttgct tccagagcct gtcggaagtc agtgatgatt ggaacggcgc 2640 

tcaatgcgag cgagatgaag aagctcatca cccacatggg tgagatggac cacccctgga 2700 

actgccccca cggcaggcca accatgaggc acgttgccaa tctggatgtc atctctcaga 2760 

actgacacac cccttgtagc atagagttta ttacagattg ttcggtttgc aaagagaagg 2820 

ttttaagtaa tctgattatc gttgtacaaa aattagcatg ctgctttaat gtactggatc 2880 

catttaaaag cagtgttaag gcaggcatga tggagtgttc ctctagctca gctacttggg 2940 

tgatccggtg ggagctcatg tgagcccagg actttgagac cactccgagc cacattcatg 3000 

agactcaatt caaggacaaa aaaaaaaaga tatttttgaa gccttttaaa aaaaaa 3056 

human PMS2 protein 

MKQLPAATVR LLSSSQIITS WSVVKELIE NSLDAGATSV DVKLENYGFD KIEVRDNGEG 60 
IKAVDAPVMA MKYYTSKINS HEDLENLTTY GFRGEALGS1 CCIAEVLITT RTAADNFSTQ 120 
YVLDGSGHIL SQKPSHLGOG TTVTALRLFK NLPVRKOFYS TAKKCKDEIK K10DLLMSFG 180 
ILKPDLRIVF VHNKAVIWOK SRVSDHKMAL MSVLGTAVMN NMESFQYHSE ESQIYLSGFL 240 
PKCDADHSFT SLSTPERSFI FINSRPVHOK DILKLIRHHY NLKCLKESTR LYPVFFLKID 300 
VPTADVDVNL TPDKSQVLLQ NKESVLIALE NLMTTCYGPL PSTNSYENNK TDVSAADIVL 360 
SKTAETDVLF NKVESSGKNY SNVDTSVIPF QNDMHNDESG KNTDDCLNHQ IS1GDFGYGH 4 20 
CSSEISNIDK NTKNAFODIS MSNVSWENSQ TEYSKTCFIS SVKHTQSENG NKDHIDESGE 480 
HEEEAGLENS SEISADEWSR GNILKNSVGE NIEPVKILVP EKSLPCKVSN NHYPIPEQMN 540 
LMEDSCNKKS NV1DNKSGKV TAYDLLSNRV IKKPMSASAL FVQDHRPQFL IENPKTSLED 600 
ATLQIEELWK TLSEEEKLKY EEKATKDLER YNSQMKRAIE QESQMSLKDG RKKIKPTSAW 660 
NLAOKHKLKT SLSNQPKLDE LLQSQIEKRR SQNIKMVQIP FSMKNLK1NF KKOHKVDLEE 720 
KDEPCLIHNL RFPDAWLMTS KTEVMLLNPY RVEEALLFKR LLENHKLPAE PLEKPIMLTE 780 
SLFNGSHYLD VLYKMTADDQ RYSGSTYLSD PRLTANGFKI KL1PGVSITE NYLEIEGMAN 840 
CLPFYGVADL KEILNAILNR NAKEVYECRP RKVISYLEGE AVRLSRQLPM YLSKEDIQDI 900 
IYRMKHQFGN EIKECVHGRP FFHHLTYLPE TT 932 
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Human PMS2 cDNA 

cgaggcggat cgggtgttgc atccatggag cgagctgaga gctcgagtac agaacctgct 60 

aaggccatca aacctattga tcggaagtca gtccatcaga tttgctctgg gcaggtggta 120 

ctgagtctaa gcactgcggt aaaggagtta gtagaaaaca gtctggatgc tggtgccact 180 

aatattgatc taaagcttaa ggactatgga gtggatctta ttgaagtttc agacaatgga 240 

tgtggggtag aagaagaaaa cttcgaaggc ttaactctga aacatcacac atctaagatt 300 

caagagtttg ccgacctaac tcaggttgaa acttttggct ttcgggggga agctctgagc 360 

tcactttgtg cactgagcga tgtcaccatt tctacctgcc acgcatcggc gaaggttgga 420 

actcgactga tgtttgatca caatcggaaa attatccaga aaacccccta cccccgcccc 4 80 

agagggacca cagtcagcgt gcagcagtta ttttccacac tacctgtgcg ccataaggaa 540 

tttcaaagga atattaagaa ggagtatgcc aaaatggtcc aggtcttaca tgcatactgt 600 

atcatttcag caggcatccg tgtaagttgc accaatcagc ttggacaagg aaaacgacag 660 

cctgtggtat gcacaggtgg aagccccagc ataaaggaaa atatcggctc tgtgtttggg 720 

cagaagcagt tgcaaagcct cattcctttt gttcagctgc cccctagtga ctccgtgtgt 780 

gaagagtacg gtttgagctg ttcggatgct ctgcataatc ttttttacat ctcaggtttc 840 

atttcacaat gcacgcatgg agttggaagg agttcaacag acagacagtt tttctttatc 900 

aaccggcggc cttgtgaccc agcaaaggtc tgcagactcg tgaatgaggt ctaccacatg 960 

tataatcgac accagtatcc atttgttgtt cttaacattt ctgttgattc agaatgcgtt 1020 

gatatcaatg ttactccaga taaaaggcaa attttgctac aagaggaaaa gcttttgttg 1080 

gcagttttaa agacctcttt gataggaatg tttgatagtg atgtcaacaa gctaaatgtc 1140 

agtcagcagc cactgctgga tgttcaaggt aacttaataa aaatgcatgc agcggatttg 1200 

gaaaagccca tggtagaaaa gcaggatcaa tccccttcat taaggactgg agaagaaaaa 1260 

aaagacgtgt ccatttccag actgcgagag gccttttctc ttcgtcacac aacagagaac 1320 

aagcctcaca gcccaaagac tccaoaacca agaaggagcc ctctaggaca gaaaaggggt 1380 

atgctgtctt ctagcacttc aggtgccatc tctgacaaag gcgtcctgag acctcagaaa 14 40 

gaggcagtga gttccagtca cggacccagt gaccctacgg acagagcgga ggtggagaag 1500 

gactcggggc acggcagcac ttccgtggat tctgaggggt tcagcatccc agacacgggc 1560 

agtcactgca gcagcgagta tgcggccagc tccccagggg acaggggctc gcaggaacat 1620 

gtggactctc aggagaaagc gcctgaaact gacgactctt tttcagatgt ggactgccat 16e0 

tcaaaccagg aagataccgg atgtaaattt cgagttttgc ctcagccaac taatctcgca 1740 

accccaaaca caaagcgttt taaaaaagaa gaaattcttt ccagttctga catttgtcaa 1800 

aagttagtaa atactcagga catgtcagcc tctcaggttg atgtagctgt gaaaattaat 1860 

aagaaagttg tgcccctgga cttttctatg agttctttag ctaaacgaat aaagcagtta 1920 

catcatgaag cacagcaaag tgaaggggaa cagaattaca ggaagtttag ggcaaagatt 1980 

tgtcctggag aaaatcaagc agccgaagat gaactaagaa aagagataag taaaacgatg 2040 

tttgcagaaa tggaaatcat tggtcagttt aacctgggat ttataataac caaactgaat 2100 

gaggatatct tcatagtgga ccagcatgcc acggacgaga agtataactt cgagatgctg 2160 

cagcagcaca ccgtgctcca ggggcagagg ctcatagcac ctcagactct caacttaact 2220 

gctgttaatg aagctgttct gatagaaaat ctggaaatat ttagaaagaa tggctttgat 2280 

tttgttatcg atgaaaatgc tccagtcact gaaagggcta aactgatttc cttgccaact 2340 

agtaaaaact ggaccttcgg accccaggac gtcgatgaac tgatcttcat gctgagcgac 2400 

agccctgggg tcatgtgccg gccttcccga gtcaagcaga tgtttgcctc cagagcctgc 2460 

cggaagtcgg tgatgattgg gactgctctt aacacaagcg agatgaagaa actgatcacc 2520 

cacatggggg agatggacca cccctggaac tgtccccatg gaaggccaac catgagacac 2580 

atcgccaacc tgggtgtcat ttctcagaac tgaccgtagt cactgtatgg aataattggt 2640 
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tttatcgcag atttttatgt tttgaaagac agagtcttca ctaacctttt ttgttttaaa 2700 
atgaaacctg ctacttaaaa aaaatacaca tcacacccat ttaaaagtga tcttgaqaac 2760 



human PMS1 protein 

MKQLPAATVR LLSSSQIITS WSWKELIE NSLDAGATSV DVKLENYGFD KIEVRDNGEG 60 
IKAVDAPVMA MKYYTSKINS HEDLENLTTY GFRGEALGSI CCIAEVLITT RTAADNFSTQ 12(7 
YVLDGSGHIL SOKPSHLGQG TTVTALRLFK NLPVRKQFYS TAKKCKDE1K KIQDLLMSFG 180 
ILKPDLR1VF VHNKAVIWQK SRVSDHKMAL MSVLGTAVMN NMESFQYHSE ESQIYLSGFL 240 
PKCDADHSFT SLSTPERSFJ FINSRPVHQK DILKLIRHHY NLKCLKESTR LY PVFFXK1D 300 
VPTADVDVNL TPDKSQVLLQ NKESVLIALE NLMTTCYGPL PSTNSYENNK TDVSAADIVL 360 
SKTAETDVLF NKVESSGKNY SNVDTSVIPF QNDMHNDESG KNTDDCLNHO ISIGDFGYGH 420 
CSSEISNIDK NTKNAFQDIS MSNVSWEHSQ TEYSKTCF1S §VKHTQSENG NKDHIDESGE 480 
NEEEAGLENS SEISADEWSR GNILKNSVGE NIEPVKILVP EKSLPCKVSN NNYP1PEQMN 540 
LNEDSCNKKS NVI DNKSGKV TAYDLLSNRV 1KKPMSASAL FVODHRPQF1. IENPKTSLED 600 
ATLQIEELWK TLSEEEKLKY EEKATKDLER YNSQMKPJUE QESQMSLKDG RKK1KPTSAW 660 
NLAQKHKLKT SLSNQPKLDE LLQSOIEKRR SQNIKMVQIP FSMKNLKINF KKQNKVDLEE 720 
KDEPCLIHNL RFPDAWLMTS KTEVMLLNPY RVEEALLFKR LLENHKLPAE PLEKPIMLTE 780 
SLFNGSHYLD VLYKMTADDQ RYSGSTYLSD PRLTANGFKI KLIPGVSITE NYLEIEGMAN 840 
CLPFYGVADL KEIINAJUW NAKEVYECRP RKVISYLEGE AVRLSROLPM YLSKED1QDI 900 
I YRMKHQFGN E1KECVHGRP FFHHLTYLPE TT 
932 

Human PMS1 cDNA 

ggcacgagtg gctgcttgcg gctagtggat ggtaattgcc tgcctcgcgc tagcagcaag 60 
ctgctctgtt aaaagcgaaa atgaaacaat tgcctgcggc aacagttcga ctcctttcaa 120 
gttctcagat catcacttcg gtggtcagtg ttgtaaaaga gcttattgaa aactccttgg 180 
atgctggtgc cacaagcgta gatgttaaac tggagaacta tggatttgat aaaattgagg 240 
tgcgagataa cggggagggt atcaaggctg ttgatgcacc tgtaatggca atgaagtact 300 
acacctcaaa aataaatagt catgaagatc ttgaaaattt gacaacttac ggttttcgtg 360 
gagaagcctt ggggtcaatt tgttgtatag ctgaggtttt aattacaaca agaacggctg 420 
ctgataattt tagcacccag tatgttttag atggcagtgg ccacatactt tctcagaaac 480 
cttcacatct tggtcaaggt acaactgtaa ctgctttaag attatttaag aatctacctg 540 
taagaaagca gttttactca actgcaaaaa aatgtaaaga tgaaataaaa aagatccaag 600 
atctcctcat gagctttggt atccttaaac ctgacttaag gattgtcttt gtacataaca 660 
aggcagttat ttggcagaaa agcagagtat cagatcacaa gatggctctc atgtcagttc 720 
tggggactgc tgttatgaac aatatggaat cctttcagta ccactctgaa g'aatctcaga 780 
tttatctcag tggatttctt ccaaagtgtg atgcagacca ctctttcact agtctttcaa 840 
caccagaaag aagtttcatc ttcataaaca gtcgaccagt acatcaaaaa gatatcttaa 900 
agttaatccg acatcattac aatctgaaat gcctaaagga atctactcgt ttgtatcctg 960 
ttttctttct gaaaatcgat gttcctacag ctgatgttga tgtaaattta acaccagata 1020 
aaagccaagt attattacaa aataaggaat ctgttttaat tgctcttgaa aatctgatga 1080 
cgacttgtta tggaccatta cctagtacaa attcttatga aaataataaa acagatgttt 1140 
ccgcagctga catcgttctt agtaaaacag cagaaacaga tgtgcttttt aataaagtgg 1200 
aatcatctgg aaagaattat tcaaatgttg atacttcagt cattccattc caaaatgata 1260 
tgcataatga Cgaatctgga aaaaacactg atgattgttt aaatcaccag ataagtattg 1320 
gtgactttgg ttatggtcat tgtagtagtg aaatttctaa cattgataaa aacactaaga 1380 
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atgcatttca ggacatttca atgagtaatg tatcatggga gaactctcag acggaatata 1440 
gtaaaacttg ttttataagt tccgttaagc acacccagtc agaaaatggc aataaagacc 1500 
atatagatga gagtggggaa aatgaggaag aagcaggtct tgaaaactct tcggaaattt 1560 
ctgcagatga gtggagcagg ggaaatatac ttaaaaattc agtgggagag aatattgaac 1620 
ctgtgaaaat tttagtgcct gaaaaaagtt taccatgtaa agtaagtaat aataattatc 1680 
caatccctga acaaatgaat cttaatgaag attcatgtaa caaaaaatca aatgtaatag 1740 
ataataaatc tggaaaagtt acagcttatg atttacttag caatcgagta atcaagaaac 1800 
ccatgtcagc aagtgctctt tttgttcaag atcatcgtcc tcagtttctc atagaaaatc I860 
ctaagaetag tttagaggat gcaacactac aaattgaaga actgtggaag acattgagtg 1920_ 
aagaggaaaa actgaaatat gaagagaagg ctactaaaga cttggaacga tacaatagtc 1980 
aaatgaagag agccattgaa caggagtcac aaatgtcact aaaagatggc agaaaaaaga 2040 
: cagcgcatgg aatttggccc agaagcacaa gttaaaaacc tcattatcta 2100 
Ltgatgaa ctccttcagt cccaaattga aaaaagaagg agtcaaaata 2160 
ttaaaatggt acagatcccc ttttctatga aaaacttaaa aataaatttt aagaaacaaa 2220 
acaaagttga cttagaagag aaggatgaac cttgcttgat ccacaatctc aggtttcctg 2280 
atgcatggct aatgacatcc aaaacagagg taatgttatt aaatccatat agagtagaag 2340 
aagccctgct atttaaaaga cttcttgaga atcataaact tcctgcagag ccactggaaa 2400 
agccaattat gttaacagag agtcttttta atggatctca ttatttagac gttttatata 2460 
aaatgacagc agatgaccaa agatacagtg gatcaactta cctgtctgat cctcgtctta 2520 
cagcgaatgg tttcaagata aaattgatac caggagtttc aattactgaa aattacttgg 2580 
aaatagaagg aatggctaat tgtctcccat tctatggagt agcagattta aaagaaattc 2640 
ttaatgctat attaaacaga aatgcaaagg aagtttatga atgtagacct cgcaaagtga 270O 
taagttattt agagggagaa gcagtgcgtc tatccagaca attacccatg tacttatcaa 2760 
aagaggacat ccaagacatt atctacagaa tgaagcacca gtttggaaat gaaattaaag 2820 
agtgtgttca tggtcgccca ttttttcatc atttaaccta tcttccagaa actacatgat 2880 
taaatatgtt taagaagatt agttaccatt gaaattggtt ctgtcataaa acagcatgag 2940 
tctggtttta aattatcttt gtattatgtg tcacatggtt attttttaaa tgaggattca 3000 
ctgacttgtt tttatattga aaaaagttcc acgtattgta gaaaacgtaa ataaactaat 3060 



human MSH2 protein 

MAVQPKETLO LESAAEVGFV RFFQGMPEKP TTTVRLFDRG DFYTAHGEDA LLAAREVFKT 60 
QGVJKYMGPA GAKNLQSWL SKMNFESFVK DLLLVRQYRV EVYKNRAGNK ASKENDWYLA 120 
YKASPGNLSQ FED1LFGNND MSASIGVVGV KMSAVDGQRQ VGVGYVDSIQ RKLGLCEFPD 180 
NDQFSNLEAL LIQIGPKECV LPGGETAGDM GKLRQI1QRG GILITERKKA DFSTKDIYQD 240 
LNRLLKGKKG EOMNSAVLPE MENQVAVSSL SAVIKFLELL SDDSNFGQFE LTTFDFSQYM 300 
KLD1AAVRAL NLFQGSVEDT TGSQSLAALL NKCKTPQGQR LVNQW1KQPL MDKNRIEERL 360 
NLVEAFVEDA ELRQTLQEDL LRRFPDLNRL AKKFQRQAAH LQDCYRLYQG INQLPNVIQA 420 
LEKHEGKHQK LI* LAV FVT PL TDLRSDFSKF QEMIETTLDM DQVENHEFLV KPSFDPNLSE 480 
LREIMNDLEK KMQSTLISAA RDLGLDPGKQ IKLDSSAQFG YYFRVTCKEE KVLRNNKNFS 540 
TVDIQKNGVK FTNSKLTSLN EEYTKNKTEY EEAQDAI VKE IVNISSGYVE PMQTLNDVLA 600 
QLDAWSFAH VSNGAPVPYV RPAILEKGQG RIILKASRHA CVEVQDEIAF IPNDVYFEKD 660 
KOMFHI ITGP NMGGKSTYIR QTGVI VLMAQ IGCFVPCESA EVSIVDCILA RVGAGDSOLK 720 
GVSTFMAEML ETASILRSAT KDSLIIIDEL GRGTSTYDGF GLAWAI SEY I ATKIGAFCMF 780 
ATHFHELTAL ANQI PTVNNL HVTALTTEET LTMLYQVKKG VCDOSFGIHV AELANFPKHV 840 
IECAKQKALE LEEFQYIGES OGYDIMEPAA KKCYLEREOG EKI IQEFLSK VKCMPFTEMS 900 
EEHITIKLKO LKAEVIAKHN SFVNEI1SR1 KVTT 934 



-42- 



BNSDOCID: <WO 01S9092A2J_> 



WO 01/59092 



PCTYUS01/04339 



Human MSH2 cDNA 

ggcgggaaac agcttagtgg gtgtggggtc gcgcattttc ttcaaccagg aggtgaggag 60 

gtttcgacat ggcggtgcag ccgaaggaga cgctgcagtt ggagagcgcg gccgaggtcg 120 

gcttcgtgcg cttctttcag ggcatgccgg agaagccgac caccacagtg cgccttttcg 180 

accggggcga cttctatacg gcgcacggcg aggacgcgct gctggccgcc cgggaggtgt 240 

tcaagaccca gggggtgatc aagtacatgg ggccggcagg ageaaagaat ctgcagagtg 300 

ttgtgcttag taaaatgaat tttgaatctt ttgtaaaaga tcttcttctg gttcgtcagt 360 

atagagttga agtttataag aatagagctg gaaataaggc atccaaggag aatgattggt 420 

atttggcata taaggcttct cctggcaatc tctctcagtt tgaagacatt ctctttggta 480 

acaatgatat gtcagcttcc attggtgttg tgggtgttaa aatgtccgca gttgatggcc 540 

agagacaggt tggagttggg tatgtggatt ccatacagag gaaactagga ctgtgtgaat 600 

tccctgataa tgatcagttc tccaatcttg aggctctcct catccagatt ggaccaaagg 660 

aatgtgtttt acccggagga gagactgctg gagacatggg gaaactgaga cagataattc 720 

aaagaggagg aattctgatc acagaaagaa aaaaagctga cttttccaca aaagacattt 780 

atcaggacct caaccggttg ttgaaaggca aaaagggaga gcagatgaat agtgctgtat 840 

tgccagaaat ggagaatcag gttgcagttt catcactgtc tgcggtaatc aagtttttag 900 

aactcttatc agatgattcc aactttggac agtttgaact gactactttt gacttcagcc 960 

agtatatgaa attggatatt gcagcagtca gagcccttaa cctttttcag ggttctgttg 1020 

aagataccac tggctctcag tctctggctg ccttgctgaa taagtgtaaa acccctcaag 1080 

gacaaagact tgttaaccag tggattaagc agcctctcat ggataagaac agaatagagg 1140 

agagattgaa tttagtggaa gcttttgtag aagatgcaga attgaggcag actttacaag 1200 

aagatttact tcgtcgattc ccagatctta accgacttgc caagaagttt caaagacaag 1260 

cagcaaactt acaagattgt taccgactct atcagggtat aaatcaacta cctaatgtta 1320 

tacaggctct ggaaaaacat gaaggaaaac accagaaatt attgttggca gtttttgtga 1380 

ctcctcttac tgatcttcgt tctgacttct ccaagtttca ggaaatgata gaaacaactt 1440 

tagatatgga tcaggtggaa aaccatgaat tccttgtaaa accttcattt gatcctaatc 150O 

tcagtgaatt aagagaaata atgaacgact tggaaaagaa gatgcagtca acattaataa 1560 

gtgcagccag agatcttggc ttggaccctg gcaaacagat taaactggat tccagtgcac 1620 

agtttggata ttactttcgt gtaacctgta aggaagaaaa agtccttcgt aacaataaaa 1680 

actttagtac tgtagatatc cagaagaatg gtgttaaatt taccaacagc aaattgactt 1740 

ctttaaatga agagtatacc aaaaataaaa cagaatatga agaagcccag gatgccattg 1800 

ttaaagaaat tgtcaatatt tcttcaggct atgtagaacc aatgcagaca ctcaatgatg 1860 

tgttagctca gctagatgct gttgtcagct ttgctcacgt gtcaaatgga gcacctgttc 1920 

catatgtacg accagccatt ttggagaaag gacaaggaag aattatatta aaagcatcca 1980 

ggcatgcttg tgttgaagtt caagatgaaa ttgcatttat tcctaatgac gtatactttg 2040 

aaaaagataa acagatgttc cacatcatta ctggccccaa tatgggaggt aaatcaacat 210O 

atattcgaca aactggggtg atagtactca tggcccaaat tgggtgtttt gtgccatgtg 2360 

agtcagcaga agtgtccatt gtggactgca tcttagcccg agtaggggct ggtgacagtc 2220 

aattgaaagg agtctccacg ttcatggctg aaatgttgga aactgcttct atcctcaggt 2280 

ctgcaaccaa agattcatta ataatcatag atgaattggg aagaggaact tctacctacg 2340 

atggatttgg gttagcatgg gctatatcag aatacattgc aacaaagatt ggtgcttttt 2400 

gcatgtttgc aacccatttt catgaactta ctgccttggc caatcagata ccaactgtta 2460 

ataatctaca tgtcacagca ctcaccactg aagagacctt aactatgctt tatcaggtga 2520 

agaaaggtgt ctgtgatcaa agttttggga ttcatgttgc agagcttgct aatttcccta 2580 

agcatgtaat agagtgtgct aaacagaaag ccctggaact tgaggagttt cagtatattg 2640 

gagaatcgca aggatatgat atcatggaac cagcagcaaa gaagtgctat ctggaaagag 2700 

agcaaggtga aaaaattatt caggagttcc tgtccaaggt gaaacaaatg ccctttactg 2760 

aaatgtcaga agaaaacatc acaataaagt taaaacagct aaaagctgaa gtaatagcaa 2820 

agaataatag ctttgtaaat gaaatcattt cacgaataaa agttactacg tgaaaaatcc 2880 
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cagcaatgga acgaaggtaa tattgataag ctattgtctg taatagtttt atattgtttt 2940 
atattaaccc tttttccata gtgttaactg tcagtgccca tgggctatca acttaataag 3000 
atatttagta atattttact ttgaggacat tttcaaagat ttttattttg aaaaatgaga 3060 
gctgtaactg aggactgttt gcaattgaca taggcaataa taagtqatgt gctgaatttt 3120 
ataaataaaa tcatgtagtt tgtgg 
3145 

human MLH1 protein 

MSFVAGVIRR LDETWNRIA AGEV1QRPAN AIKEM1ENCL DAKSTSIQVI VKEGGLKLIQ 60 - 
IQDNGTGIRK EDLDIVCERF TTSKLQSFED LAS I ST YGFR GEALAS1SHV AHVTITTKTA 120 
DGKCAYRASY SDGKLKAPPK PCAGNQGTQ1 TVEDLFYNIA TRRKALKNPS EEYGKJ LEW 180 
GRYSVHNAGI SFSVKKQGET VADVRTLPNA STVDNIRSIF GNAVSREL1E IGCEDKTLAF 240 
KMNGYISNAN YSVKKC1 FLL FINHPLVEST SLRKAIETVY AAYLPKNTHP FLYLSLEISP 300 
QNVDVNVHPT KHEVHFLHEE SILERVQQHI ESKLLGSNSS RMYFTQTLLP GLAGPSGEMV 360 
KSTTSLTSSS TSGSSDKVYA HOMVRTDSRE QKLDAFLQPL SKPLSSQPQA IVTEDKTDIS 420 
SGRAFQQDEE MLELPAPAEV AAKNQSLEGD TTKGTSEMSE KRGPTSSNPR KRHREDSDVE 480 
MVEDDSRKEM TAACT PRRRI INLTSVLSLQ EEINEQGHEV LREMLHNHSF VGCVNPQWAL 540 
AOHQTKLYLL NTTKLSEELF YQILIYDFAN FGVLRLSEPA PLFDLAMLAL DSPESGWTEE 600 
DGPKEGLAEY I VEFLKKKAE MLADYFSLEI DEEGNLIGLP LLI DNYVPPL EGLPIFILRL 660 
ATEVMWDEEK ECFESLSKEC AMFYSIRKQY ISEESTLSGQ OSEVPGSIPN SWKWTVEHIV 720 
YKALRSHILP PKHFTEDGN1 LQLAHLPDLY KVFERC 755 

Human MLH1 cDNA 

cttggctctt ctggcgccaa aatgtcgttc gtggcagggg ttattcggcg gctggacgag 60 
acagtggtga accgcatcgc ggcgggggaa gttatccagc ggccagctaa tgctatcaaa 120 
gagatgattg agaactgttt agatgcaaaa tccacaagta ttcaagtgat tgttaaagag 180 
ggaggcctga agttgattca gatccaagac aatggcaccg ggatcaggaa agaagatctg 24 0 
gatattgtat gtgaaaggtt cactactagt aaactgcagt cctttgagga tttagccagt 300 
atttctacct atggctttcg aggtgaggct ttggccagca taagccatgt ggctcatgtt 360 
actattacaa cgaaaacagc tgatggaaag tgtgcataca gagcaagtta ctcagatgga 420 
aaactgaaag cccctcctaa accatgtgct ggcaatcaag ggacccagat cacggtggag 460 
gacctttttt acaacatagc cacgaggaga aaagctttaa aaaatccaag tgaagaatat 540 
gggaaaattt tggaagttgt tggcaggtat tcagtacaca atgcaggcat tagtttctca 600 
gttaaaaaac aaggagagac agtagctgat gttaggacac tacccaatgc ctcaaccgtg 660 
gacaatattc gctccatctt tggaaatgct gttagtcgag aactgataga aattggatgt 720 
gaggataaaa ccctagcctt caaaatgaat ggttacatat ccaatgcaaa ctactcagtg 780 
aagaagtgca tcttcttact cttcatcaac catcgtctgg tagaatcaac ttccttgaga 840 
aaagccatag aaacagtgta tgcagcctat ttgcccaaaa acacacaccc attcctgtac 900 
ctcagtttag aaatcagtcc ccagaatgtg gatgttaatg tgcaccccac aaagcatgaa 960 
gttcacttcc tgcacgagga gagcatcctg gagcgggtgc agcagcacat cgagagcaag 1020 
ctcctgggct ccaattcctc caggatgtac ttcacccaga ctttgctacc aggacttgct 1080 
ggcccctctg gggagatggt taaatccaca acaagtctga cctcgtcttc tacttctgga 1140 
agtagtgata aggtctatgc ccaccagatg gttcgtacag attcccggga acagaagctt 1200 
gatgcatttc tgcagcctct gagcaaaccc ctgtccagtc agccccaggc cattgtcaca 1260 
gaggataaga cagatatttc tagtggcagg gctaggcagc aagatgagga gatgcttgaa 1320 
ctcccagccc ctgctgaagt ggctgccaaa aatcagagct tggaggggga tacaacaaag 1380 
gggacttcag aaatgtcaga gaagagagga cctacttcca gcaaccccag aaagagacat 1440 
cgggaagatt ctgatgtgga aatggtggaa gatgattccc gaaaggaaat gactgcagct 1500 
igaaggat cattaacctc actagtgttt tgagtctcca ggaagaaatt 1560 
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aatgagcagg 


gacatgaggt 


tctccgggag 


atgttgcata 




cgtgggctgt 


1620 




agtgggcctt 


ggcacagcat 


caaaccaagt 


tataccttct 


caacaccacc 




aagcttagtg 


aagaactgt t 


ctaccagata 


ctcatttatg 


att ttgccaa 


ttttggtgtt 


1740 


ctcaggttat 


cggagccagc 


accgctcttt 


gacct tgcca 


tgcttgcctt 


agatagtcco 


1800 


gagagtggct 


ggacagagga 


agatggtccc 


aaagaaggac 


ttgctgaata 


cattgttgag 


1860 


tttctgaaga 


agaaggctga 


gatgcc tgca 


gactatttct 


ctttggaaat 


tgacgaggaa 


1920 


gggaacctga 


ttggattacc 


ccttctgatt 


gacaactatg 


tgcccccttt 


ggagggactg 


1980 


cctatcttca 


ttcttcgact 


agccactgag 


gtgaattggg 


acgaagaaaa 


ggaatgtttt 


2040 


gaaagcctca 


gtaaagaatg 


cgctatgttc 




ggaagcagta 


catatctgag 


2100 


gagtcgaccc 


tctcaggcca 


gcagagtgaa 


gtgcctggct 


ccattccaaa 


ctcctggaag 




tggactgtgg 


aacacattgt 


ctataaagcc 


ttgcgctcac 






2220 


ttcacagaag 


atggaaatat 


cctgcagctt 


gctaacctgc 






2280 


gagaggtgtt 


aaatatggtt 


atttatgcac 


tgtgggatgt 




ctctgtattc 


2340 


cgatacaaag 


tgttgtatca 


aagtgtgata 


tacaaagtgt 


accaacataa 


gtgttggtag 


2400 


cacttaagac 


ttatacttgc 


cttctgatag 


tattccttta 


tacacagtgg 


attgattata 


2460 






cata 








2484 



hPMS2-134 protein 

MKQLPAATVR LLSSSQI ITS WSWKELIE 
IKAVDAPVMA MKYYTSKINS HEDLENLTTY 
YVLDGSGHIL SQK 



NSLDAGATSV DVKLENYGFD KIEVRDNGEG 60 
GFRGEALGSI CCIAEVLITT RTAADNFSTQ 120 
133 



hPMS2-134 cDNA 
cgaggcggat cgggtgttgc 
aaggccatca aacctattga 
ctgagtctaa gcactgcggt 
aatattgatc taaagcttaa 
tgtggggtag aagaagaaaa 
caagagtttg ccgacctaac 
tcactttgtg cactgagcga 



atccatggag cgagctgaga 
tcggaagtca gtccatcaga 
aaaggagtta gtagaaaaca 
ggactatgga gtggatctta 
cttcgaaggc ttaactctga 
tcaggttgaa acttttggct 
tgtcaccatt tctacctgcc 



gctcgagtac agaacctgct 60 
tttgctctgg gcaggtggta 120 
gtctggatgc tggtgccact 180 
ttgaagtttc agacaatgga 240 
aacatcacac atctaagatt 300 
ttcgggggga agctctgagc 360 
acgcatcggc gaaggttgga 420 
426 



hMSH6 (human cDNA) ACCESSION U2894 6 
MSRQSTLYSFFPKSPALSDANKASARASREGGRAAAAPGASPSP 

GGDAAWSEAGPGPRPLARSASPPKAKNLNGGLRRSVAPAAPTSCDFSPGDLVWAKMEG 
YPVTOPCLVYNHPFDGTFIREKGKSVRVHVQFFDDSPTRGWVSKRLLKPYTGSKSKEAQ 
KGGHFYSAKPEILRAMQRADEALNKDKIKRIXLAVCDEPSEPEEEEEMEVGTTYVTDK 
SEEDNEIESEEEVQPKTQGSRRSSRQIKKRRVISDSESDIGGSDVEFKPDTKEEGSSD 
EISSGVGDSESEGLNSPVKVARKRKRMVTGNGSLKRKSSRKETPSATKQATS1SSETK 
NTLRAFSAPQNSESOAUVSGGGDDSSRPTVWYHETLEWLKEEKRRDEHRRRPDHPDFD 
ASTLYVPEDFLNSCTPGMRKWW0IKSONFDLVICYKVGKFYELYHMDALIGVSELGLV 
FMKGNWAHSGFPEIAFGRYSDSLVOKGYKVARVEQTETPEMMEARCRKMAHISKYDRV 
VRREICRI ITKGTQTYSVLEGDPSENYSKYLLSLKEKEEDSSGHTRAYGVCFVDTSLG 
KFFIGQFSDDRHCSRFRTLVAHYPPVQVLFEKGNLSKETKTILKSSLSCSLQEGLIPG 
SOFHDASKTLRTLLEEEYFREKLSDGIGVMLPQVLKGMTSESDSIGLTPGEKSELAI-S 
ALGGCVFYLKKCLI DQELLSMANFEEYI PLDSDTVSTTRSGAI FTKAYQRMVLDAVTL 
NNLEIFLNGTNGSTEGTLLERVDTCHTPFGKRLLKQWLCAPLCNHYAINDRLDAIEDL 
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MWPDKISEWELLKKLPDLERLLSKIHNVGSPLKSQNHPDSRAIMYEETTYSKKKII 
^ DFLSALEGFKVMCKI IGIMEEVADGFKSKILKQVISLQTKNPEGRFPDLTVELNRWDT 
AFDHEKARKTGLlTPKAGFDSDYDQALADIRENEOSLLEYLEKQRHRIGCRTlVYWn 
GRNRYQLEIPENFTTRNLPEEYELKSTKKGCKRYWTKTIEKKLAHLIMAEERRDVSLK 

10 DCMRRLFYNFDKNYKDWQSAVECIAVLDVLLCLANYSRGGDGPMCRPVILLFEDTPPF 
LELKGSRHPCITKTFFGDDFI PNDIL1GCEEEEQENGKAYCVLVTGPNMGGKSTLMRQ 

j ^ AGLLAVMAQMGCYVPAEVCRLTPIDRVFTRLGASDRIMSGESTFFVELSETASILMHA 
TAHSLVLVDELGRGTATFDGTAIANAVVKELAETIKCRTLFSTHYHSLVEDYSONVAV 
RLGHMACMVENECEDPSQETITFLYKFIKGAePKSYGFNAARLANLPEEVIQKGHRKA 

20 REFEKMNQSLRLFREVCLASERSTVDAEAVHKLLTL I KEL " 



hPMSR2 (human CDNA) ACCESSION U38964 



1 ggcgctccta c< 
61^ttggggagcc ac 

181 caccaacggt 

241 ggaaaccgcc 

301 aggccacgac 

361 gcagtccggt 

421 gagccactag 

481 gagcacaacg 

541 ggagggaact 

601 aactcgagta 

721 aacagtctgg 

781 ctcattgaag 

841 ctgaaacatc 

901 ggctttcggg 

961 tgccgtgtat 

1021 cagaaaaccc 

1081 acgctacctg 

1201 cagcctgtag 
1261 gt. 



1321 
1381 , 



tggtt( 



:tgcaagtg gctagtgcca 
itacatgca ggtgggctcc a. 
ggtacatcgg catggcgcaa 
cggaaaccgc cagacaccaa 
agacaccaac gctcggaaac 
ggagggcgac tacctccctt 
gtgctctgat tggtccaggc 
gcgaaaagga gagacgggaa 
tcgaaagcag ccgttgggag 
ttcccagtcc ccgaggcgga 
cagaacctgc taaggccatc 
ggccggtggt accgagtcta 
atgctggtgc cactaatgtt 
tttcaggcaa tggatgtggg 
acacatgtaa gattcaagag 
gggaagctct gagctcactt 
cagcgaaggt tgggactcga 
cctacccccg ccccagaggg 
tgcaccataa agaatttcaa 
tctgccgtga ttgtcagttt 
aactgactcc tagaagtacc 
tattcagctc tgtcaagaat 
tcctccttcc tgatgtctgc 



igtgctgggc cgccgctcct gccgtgcatg 
cacggagag gggcgcagac ccggtgacag 
ccaaagcaag agagggtggc gcgtgccaga 
cggtcggaaa ccgccaagac accaacgctc 
cgceagacac caaqgctcgg aatccacgcc 
ctgaccctgc tgctggcgtt cggaaaaaac 
tctttgacgt cacggactcg acctttgaca 
gtattttttc cgccccgccc ggaaagggtg 
cccaggaggc ggggcgcctg tgggagccgt 
tccggtgttg catccttgga gcgagctgag 
aaacctattg atcggaagtc agtccatcag 
aggccgaatg cggtgaagga gttagtagaa 
gatctaaagc ttaaggacta tggagtggat 
gtagaagaag aaaacttcga aggctttact 
tttgccgacc taactcaggt ggaaactttt 
tgtgcactga gtgatgtcac catttctacc 
ctggtgtttg atcactatgg gaaaatcatc 
atgacagtca gcgtgaagca gttattttct 
aggaatatta agaagaaacg tgcctgcttc 
cctgaggcct ccccagccat gcttcctgta 
ccaccccacc cctgctcctt ggaggacaac 
ggtccaggtt cttctagatg atctgcacaa 
cattagcatt ggaataaagt tcctgctgaa 



hPMSR2 (human protein) ACCESSION U38964 
MAQPKQERVARARHQRSETARHORSETAKTPTLGNRQTPTLGNR 

QTPRLGIHARPRRRATTSLLTLLLAFGKNAVRCALIGPGSLTSRTRPLTEFLGEKERR 
EVFFPPRPERVEHNVESSRWEPRRRGACGSRGGNFPSPRGGSGVASLERAENSSTEPA 
KAIKPIDRKSVHQICSGPVVPSLRPNAVKELVENSLDAGATNVDLKLKDYGVDLIEVS 
GNGCGVEEENFEGFTLKHHTCKIQEFADLTQVETFGFRGEALSSLCALSDVTISTCRV 
SAKVGTRLVFDHYGKIIQKTPYPRPRGMTVSVKQLFSTLPVHHKEFQRNIKKKRACFP 
FAFCRDCQFPEASPAMLPVQPVELTPRSTPPHPCSLEDNVJTVFSSVKNGPGSSR 



HPMSR3 (human CDNA) ACCESSION U38979 

1 tttttagaaa ctgatgttta ttttccatca accatttttc catgctgctt aagagaatat 

61 gcaagaacag cttaagacca gtcagtggtt gctcctaccc attcagtggc ctgagcagtg 

121 gggagctgca gaccagtctt ccgtggcagg ctgagcgctc cagtcttcag tagggaattg 

181 ctgaataggc acagagggca cctgtacacc ttcagaccag tctgcaacct caggctgagt 

241 agcagtgaac tcaggagcgg gagcagtcca ttcaccctga aattcctcct tggtcactgc 

301 cttctcagca gcagcctgct cttctttttc aatctcttca ggatctctgt agaagtacag 

361 atcaggcatg acctcccatg ggtgttcacg ggaaatggtg ccacgcatgc gcagaacttc 

421 ccgagccagc atccaccaca ttaaacccac tgagtgagct cccttgttgt tgcatgggat 

481 ggcaatgtcc acatagcgca gaggagaatc tgtgttacac agcgcaatgg taggtaggtt 
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541 aacataagat gcctccgtga gaggcgaagg ggcggcggga cccgggcctg gcccgtatgt 

601 gtccttggcg gcctagacta ggccgtcgct gtatggtgag ccccagggag gcggatctgg 

661 gcccccagaa ggacacccgc ctggatttgc cccgtagccc ggcccgggcc cctcgggagc 

721 agaacagcct tggtgaggtg gacaggaggg gacctcgcga gcagacgcgc gcgccagcga 

781 cagcagcccc gccccggcct ctcgggagcc ggggggcaga ggctgcggag ccccaggagg 

841 gtctatcagc cacagtctct gcatgtttcc aagagcaaca ggaaatgaac acattgcagg 

901 ggccagtgtc attcaaagat gtggctgtgg atttcaccca ggaggagtgg cggcaactgg 

961 accctgatga gaagatagca tacggggatg tgatgttgga gaactacagc catctagttt 

1021 ctgtggggta tgattatcac caagccaaac atcatcatgg agtggaggtg aaggaagtgg 

1081 agcagggaga ggagccgtgg ataatggaag gtgaatttcc atgtcaacat agtccagaac 

1141 ctgctaaggc catcaaacct attgatcgga agtcagtcca tcagatttgc tctgggccag 

1201 tggtactgag tctaagcact gcagtgaagg agttagtaga aaacagtctg gatgctggtg 

1261 ccactaatat tgatctaaag cttaaggact atggagtgga tctcattgaa gtttcagaca 

1321 atggatgtgg ggtagaagaa gaaaactttg aaggcttaat ctctttcagc tctgaaacat 

1381 cacacatgta agattcaaga gtttgccgac ctaactgaag ttgaaacttt cggttttcag 

1441 ggggaagctc tgagctcact gtgtgcactg agcgatgtca ccatttctac ctgccacgcg 

1501 ttggtgaagg ttgggactcg actggtgttt gatcacgatg ggaaaatcat ccaggaaacc 

1561 ccctaccccc accccagagg gaccacagtc agcgtgaagc agttattttc tacgctacct 

1621 gtgcgccata aggaatttca aaggaatatt aagaagacgt gcctgcttcc ccttcgcctt 

1681 ctgccgtgat tgtcagtttc ctgaggcctc cccagccatg cttcctgtac agcctgcaga 

1741 actgtgagtc aattaaacct cttttcttca taaattaaaa aaaaa 



hPMSR3 (human protein) ACCESSION U38979 



MCPWRPRLGRRCMVSPREADLGPQKDTRLDLPRSPARAPREQNS 

LGEVDRRGPREQTRAPATAAPPRPLGSRGAEAAEPQEGLSATVSACFQEQQEMNTLQG 

PVSFKDVAVDFTQEEWRQLDPDEKIAYGDWLENYSHLVSVGYDYHQAKHHHGVEVKE 

VEQGEEPWIMEGEFPCQHSPEPAKAIKPIDRKSVHQICSGPVVLSLSTAVKELVENSL 

DAGATN1DLKLKDYGVDL1EVSDNGCGVEEENFEGLISFSSETSH.M" 



hPMSL9 (human cDNA) ACCESSION NM00S395 

1 atgtgtcctt ggcggcctag actaggccgt cgctgtatgg tgagccccag ggaggcggat 

61 ctgggccccc agaaggacac ccgcctggat ttgccccgta gcccggcccg ggcccctcgg 

121 gagcagaaca gccttggtga ggtggacagg aggggacctc gcgagcagac gcgcgcgcca 

181 gcgacagcag ccccgccccg gcctctcggg agccgggggg cagaggctgc ggagccccag 

241 gagggtctat cagccacagt ctctgcatgt ttccaagagc aacaggaaac gaacacattg 

301 caggggccag tgtcattcaa agatgtggct gtggatttca cccaggagga gtggcggcaa 

361 ctggaccctg atgagaagat agcatacggg gatgtgatgt tggagaacta cagccatcta 

421 gtttctgtgg ggtatgatta tcaccaagcc aaacatcatc atggagtgga ggtgaaggaa 

4B1 gtggagcagg gagaggagcc gtggataatg gaaggtgaat ttccatgtca acatagtcca 

541 gaacctgcta aggccatcaa acctattgat cggaagtcag tccatcagat ttgctctggg 

601 ccagtggtac tgagtctaag cactgcagtg aaggagttag tagaaaacag tctggatgct 

661 ggtgccacta atattgatct aaagcttaag gactatggag tggatctcat tgaagtttca 

721 gacaatggat gtggggtaga agaagaaaac tttgaaggct taatctcttt cagctctgaa 

781 acatcacaca tgtaa 



hPMSL9 {human protein) ACCESSION NM005395 
MCPWRPRLGRRCMVSPREADLGPQKDTRLDLPRSPARAPREQNS 

LGEVDRRGPREQTRAPATAAPPRPLGSRGAEAAEPQEGLSATVSACFQEQQEMNTLQG 
PVSFKDVAVDFTQEEWRQLDPDEKIAYGDVMLENYSHLVSVGYDYHQAKHHHGVEVKE 
VEQGEEPWIMEGEFPCQHSPEPAKAIKPIDRKSVHQICSGPVVLSLSTAVKELVENSL 
DAGATN1DLKLKDYGVDLIEVSDNGCGVEEENFEGLISFSSETSHM" 
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WE CLAIM: 

1 . A method for making a hypermutable bacteria, comprising the step of: 
introducing into a bacterium a polynucleotide comprising a dominant 

5 negative allele of a mismatch repair gene under the control of an inducible 
transcription regulatory sequence, whereby the cell becomes inducibly 
hypermutable. 

2. The method of claim 1 wherein the mismatch repair gene is MutH from 
any species. 

10 3. The method of claim 1 wherein the mismatch repair gene is a MutS 
homolog from any species. 

4. The method of claim 1 wherein the mismatch repair gene is a MutL 
homolog from any species. 

5. The method of claim 1 wherein the mismatch repair gene is a MutY 
1 5 homolog from any species. 

6. The method of claim 1 wherein the mismatch repair gene is PMS2. 

7. The method of claim 1 wherein the mismatch repair gene is plant PMS2. 

8. The method of claim 1 wherein the mismatch repair gene is MLH1. 

9. The method of claim 1 wherein the mismatch repair gene is MLH3. 
20 1 0. The method of claim 1 wherein the mismatch repair gene is MSH2. 

11. The method of claim 1 wherein the mismatch repair gene is a PMSR or 
PMSL homolog. 

1 2. The method of claim 3 wherein the allele comprises a truncation 
mutation. 

25 1 3. The method of claim 4 where the allele comprises a truncation 
mutation. 

14. The method of claim 6 where the allele comprises a truncation 
mutation. 

1 5. The method of claim 7 where the allele comprises a truncation 
30 mutation. 
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16. The method of claim 4 wherein the allele comprises a truncation 
mutation at codon 1 34. 

1 7. The method of claim 6 wherein the allele comprises a truncation 
mutation at codon 1 34. 

5 1 8. A homogeneous composition of cultured, hypermutable, bacteria which 
comprise a dominant negative allele of a mismatch repair gene under the 
control of an inducible transcriptional regulatory sequence. 
19. The homogeneous composition of claim 1 8 wherein the mismatch repair 
gene is a mutL gene or a homolog thereof. 
10 20. The homogeneous composition of claim 1 8 wherein the mismatch 
repair gene is PMS2 or a homolog thereof. 

21. The homogeneous composition of claim 18 wherein the mismatch 
repair gene is MLH1 or a homolog thereof. 

22. The homogeneous composition of claim 18 wherein the mismatch 
1 5 repair gene is PMSR or a PMSR homolog. 

23. The homogeneous composition of claim 1 8 wherein the mismatch 
repair gene is mutS or a homolog thereof 

24. The homogeneous composition of claim 1 8 wherein the mismatch 
repair gene is eukaryotic. 

20 25. The homogeneous composition of claim 1 8 wherein the mismatch 
repair gene is prokaryotic. 

26. The homogeneous composition of claim 20 wherein thebacteria 
comprise a protein which consists of the first 133 amino acids of PMS2. 

27. The homogeneous composition of claim 26 wherein the protein is 
25 human PMS2. 

28. The homogeneous composition of claim 23 comprising a mammalian 
MutS protein. 

29. The homogeneous composition of claim 19 comprising a mammalian 
MutL protein. 
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30. The homogeneous composition of claim 19 comprising a eukaryotic 
MutL protein. 

31. The homogeneous composition of claim 23 comprising a protein which 
5 consists of a eucaryotic MutS protein. 

32. A method for generating a mutation in a gene of interest comprising 
the steps of: 

growing a bacterial culture comprising the gene of interest and a 
dominant negative allele of a mismatch repair gene under the control of an 
10 inducible transcriptional regulatory sequence, wherein the cell is 
hypermutable; 

testing the cell to determine whether the gene of interest harbors a 
mutation. 

33. The method of claim 32 wherein the step of testing comprises 
1 5 analyzing a nucleotide sequence of the gene of interest. 

34. The method of claim 32 wherein the step of testing comprises analyzing 
mRNA transcribed from the gene of interest. 

35. The method of claim 32 wherein the step of testing comprises analyzing 
a protein encoded by the gene of interest. 

20 36. The method of claim 32 wherein the step of testing comprises analyzing 
a phenotype associated with the gene of interest. 

37. The method of claim 32 wherein bacteria in the bacterial culture are 
made by the process of introducing a polynucleotide comprising a dominant 
negative allele of a mismatch repair gene into a bacterium, whereby the cell 

25 becomes hypermutable. 

38. The method of claim 37 wherein the step of testing comprises analyzing 
the nucleotide sequence from the gene of interest. 

39. The method of claim 37 wherein the step of testing comprises 
analyzing a protein encoded by the gene of interest. 
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40. The method of claim 37 wherein the step of testing comprises analyzing 
a phenotype associated with the gene of interest. 

41. A method for generating a mutation in a gene of interest comprising the 
steps of: 

5 growing a bacterium comprising the gene of interest and a dominant 

negative allele of a mismatch repair gene under the transcriptional control 
of an inducible regulatory sequence to form a population of mutated 
bacteria; 

cultivating the population of mutated bacteria under trait selection 
10 conditions; and 

testing at least one of the cultivated bacteria to determine whether 
the gene of interest harbors a mutation. 

42. The method of claim 41 wherein the step of testing comprises 
analyzing a nucleotide sequence of the gene of interest. 

15 43. The method of claim 41 wherein the step of testing comprises 
analyzing mRNA transcribed from the gene of interest. 

44. The method of claim 41 wherein the step of testing comprises 
analyzing a protein encoded by the gene of interest. 

45. The method of claim 41 wherein the step of testing comprises 
20 analyzing a phenotype associated with the gene of interest. 

46. The method of claim 41 further comprising the step of growing a 
cultivated bacteria harboring a mutation in the gene of interest to 
manufacture a protein produced by said bacteria and harvesting the 
protein therefrom. 

25 47. The method of claim 41 further comprising the step of growing a 
cultivated bacteria harboring a mutation in the gene of interest to 
biotransform a substrate. 
48. The method of claim 4 1 further comprising the step of growing a 
cultivated bacteria harboring a mutation in the gene of interest to 

30 bioremediate. 
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49. The method of claim 41 wherein the gene of interest is an antibiotic 
resistance determinant and further comprising the step of comparing the 
genome of the cultivated bactereia harboring a mutation in the 
antibiotic resistance determinant to a genome of a corresponding wild 



50. The method of claim 41 further comprising the step of growing a 
cultivated bacteria harboring a mutation in the gene of interest to screen 
effects of a compund library. 

51. A method for enhancing the mutation rate of a bacterium, 



exposing a bacterium comprising a dominant negative allele of an 
MMR gene to a mutagen whereby the mutation rate of the bacterium is 
enhanced in excess of the rate in the absence of mutagen and in excess of 
the rate in the absence of the dominant negative allele. 
1 5 52.The method of claim 5 1 wherein the mutagen is a chemical mutagenic 
agent. 

53. The method of claim 5 1 wherein the mutagen is a DNA alkylating 
agent. 

54. The method of claim 5 1 wherein the mutagen is a DNA intercalating 
20 agent. 

55. The method of claim 5 1 wherein the mutagen is a DNA oxidizing agent. 

56. The method of claim 5 1 wherein the mutagen is ionizing radiation. 

57. The method of claim 51 wherein the chemical mutagen is ultraviolet 
radiation. 

25 58. A method for generating an MMR-proficient bacterium with a new 
output trait, comprising: 

growing a mismatch repair deficient bacterium comprising a 
defective mismatch repair gene allele and a gene of interest, to form a 
population of mutated bacteria; 



5 



type strain, thereby identifying the gene of interest. 



10 



comprising the steps of: 
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cultivating the population of mutated bacteria under trait selection 
conditions; 

testing at least one of the cultivated bacteria to determine that the 
gene of interest harbors a mutation; and 
5 restoring mismatch repair activity to the at least one cultivated 

bacteria. 

59. The method of claim 58 wherein the mimatch repair deficient bacterium 
comprises a dominant negative allele of a mismatch repair gene. 

60. The method of claim 59 where in the step of restoring MMR activity 
1 0 comprises removing an inducer chemical that positively regulates the 

dominant negative MMR allele's expression. 

61 . The method of claim 59 where in the step of restoring MMR activity 
comprises excising the dominant negative gene by recombination. 

62. The method of claim 59 where in the step of restoring MMR activity 
1 5 comprises knocking out the MMR dominant negative gene allele. 

63. The method of claim 59 where in the step of restoring MMR activity 
comprises negatively selecting for loss of the dominant negative allele from 
bacterial host. 

64. The method of claim 58 wherein the mismatch repair deficient 

20 bacterium is treated with a mutagen to enhance the rate of mutation. 

65. The method of claim 58 wherein the mismatch repair deficient 
bacterium is treated with a chemical mutagen to enhance the rate of 
mutation. 

66. The method of claim 58 wherein the mismatch repair deficient 
25 bacterium is treated with ionizing radiation to enhance the rate of 

mutation. 

67. The method of claim 58 wherein the mismatch repair deficient 
bacterium is treated with ultraviolet (UV) irradiation to enhance the rate 
of mutation. 
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68. The method of claim 58 wherein the step of restoring MMR activity 
comprises complementing with a wild-type MMR allele. 

69. The method of claim 5 1 wherein the dominant negative allele is under 
the control of an inducible promoter. 

5 70. The method of claim 58 wherein the dominant negative allele is under 
the control of an inducible promoter. 
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